a 
a4 - 
ic ael E Taylor 
| A ) VW 
a 
- _ 


Partial 
Differential 
seal 


Third Edition 


Q) Springer 


Applied Mathematical Sciences 


Founding Editors 


F. John 
J. P. LaSalle 
L. Sirovich 


Volume 116 


Series Editors 


Anthony Bloch, Department of Mathematics, University of Michigan, Ann 
Arbor, MI, USA 


C. L. Epstein, Department of Mathematics, University of Pennsylvania, 
Philadelphia, PA, USA 


Alain Goriely, Department of Mathematics, University of Oxford, Oxford, UK 
Leslie Greengard, New York University, New York, NY, USA 


Advisory Editors 


J. Bell, Center for Computational Sciences and Engineering, Lawrence Berkeley 
National Laboratory, Berkeley, CA, USA 


P. Constantin, Department of Mathematics, Princeton University, Princeton, NJ, 
USA 


R. Durrett, Department of Mathematics, Duke University, Durham, CA, USA 


R. Kohn, Courant Institute of Mathematical Sciences, New York University, 
New York, NY, USA 


R. Pego, Department of Mathematical Sciences, Carnegie Mellon University, 
Pittsburgh, PA, USA 


L. Ryzhik, Department of Mathematics, Stanford University, Stanford, CA, USA 
A. Singer, Department of Mathematics, Princeton University, Princeton, NJ, USA 


A. Stevens, Department of Applied Mathematics, University of Minster, 
Miinster, Germany 


S. Wright, Computer Sciences Department, University of Wisconsin, Madison, 
WI, USA 


The mathematization of all sciences, the fading of traditional scientific 
boundaries, the impact of computer technology, the growing importance of 
computer modeling and the necessity of scientific planning all create the need 
both in education and research for books that are introductory to and abreast 
of these developments. The purpose of this series is to provide such books, 
suitable for the user of mathematics, the mathematician interested in applications, 
and the student scientist. In particular, this series will provide an outlet for topics 
of immediate interest because of the novelty of its treatment of an application or 
of mathematics being applied or lying close to applications. These books should 
be accessible to readers versed in mathematics or science and engineering, and 
will feature a lively tutorial style, a focus on topics of current interest, and present 
clear exposition of broad appeal. A compliment to the Applied Mathematical 
Sciences series is the Texts in Applied Mathematics series, which publishes 
textbooks suitable for advanced undergraduate and beginning graduate courses. 


Michael E. Taylor 


Partial Differential 
Equations II 


Qualitative Studies of Linear Equations 


Third Edition 


g) Springer 


Michael E. Taylor 
Department of Mathematics 
University of North Carolina 
Chapel Hill, NC, USA 


ISSN 0066-5452 ISSN 2196-968X (electronic) 
Applied Mathematical Sciences 
ISBN 978-3-031-33699-7 ISBN 978-3-031-33700-0 (eBook) 


https://doi.org/10.1007/978-3-031-33700-0 
Mathematics Subject Classification: 35-01 


1* & 2™ editions: © Springer Science+Business Media, LLC 1996, 2011 
3" edition: © The Editor(s) (if applicable) and The Author(s), under exclusive license to Springer 
Nature Switzerland AG 2023 


This work is subject to copyright. All rights are solely and exclusively licensed by the Publisher, 
whether the whole or part of the material is concerned, specifically the rights of translation, reprinting, 
reuse of illustrations, recitation, broadcasting, reproduction on microfilms or in any other physical 
way, and transmission or information storage and retrieval, electronic adaptation, computer software, 
or by similar or dissimilar methodology now known or hereafter developed. 

The use of general descriptive names, registered names, trademarks, service marks, etc. in this 
publication does not imply, even in the absence of a specific statement, that such names are exempt 
from the relevant protective laws and regulations and therefore free for general use. 

The publisher, the authors, and the editors are safe to assume that the advice and information in this 
book are believed to be true and accurate at the date of publication. Neither the publisher nor the 
authors or the editors give a warranty, expressed or implied, with respect to the material contained 
herein or for any errors or omissions that may have been made. The publisher remains neutral with 
regard to jurisdictional claims in published maps and institutional affiliations. 


This Springer imprint is published by the registered company Springer Nature Switzerland AG 
The registered company address is: Gewerbestrasse 11, 6330 Cham, Switzerland 


To my wife and daughter, Jane Hawkins 
and Diane Hart 


Contents of Volumes I and III 


Volume I: Basic Theory 


Basic Theory of ODE and Vector Fields 


The Laplace Equation and Wave Equation 


Fourier Analysis, Distributions, and Constant-Coefficient Linear PDE 


Sobolev Spaces 

Linear Elliptic Equations 
Linear Evolution Equations 
Outline of Functional Analysis 


Manifolds, Vector Bundles, and Lie Groups 


Volume III: Nonlinear Equations 


13 


14 


15 


16 


17 


18 


Function Space and Operator Theory for Nonlinear Analysis 
Nonlinear Elliptic Equations 

Nonlinear Parabolic Equations 

Nonlinear Hyperbolic Equations 

Euler and Navier-Stokes Equations for Incompressible Fluids 


Einstein’s Equations 


Preface 


Partial differential equations are a many-faceted subject. Created to describe the 
mechanical behavior of objects such as vibrating strings and blowing winds, it 
has developed into a body of material that interacts with many branches of 
mathematics, such as differential geometry, complex analysis, and harmonic 
analysis, as well as a ubiquitous factor in the description and elucidation of 
problems in mathematical physics. 

This work is intended to provide a course of study of some of the major 
aspects of PDE. It is addressed to readers with a background in the basic intro- 
ductory graduate mathematics courses in American universities: elementary real 
and complex analysis, differential geometry, and measure theory. 

Chapter | provides background material on the theory of ordinary differential 
equations (ODE). This includes both very basic material — on topics such as the 
existence and uniqueness of solutions to ODE and explicit solutions to equations 
with constant coefficients and relations to linear algebra — and more sophisticated 
results — on flows generated by vector fields, connections with differential 
geometry, the calculus of differential forms, stationary action principles in 
mechanics, and their relation to Hamiltonian systems. We discuss equations of 
relativistic motion as well as equations of classical Newtonian mechanics. There 
are also applications to topological results, such as degree theory, the Brouwer 
fixed-point theorem, and the Jordan-Brouwer separation theorem. In this chapter, 
we also treat scalar first-order PDE, via the Hamilton—Jacobi theory. 

Chapters 2—6 constitute a survey of basic linear PDE. Chapter 2 begins with 
the derivation of some equations of continuum mechanics in a fashion similar to 
the derivation of ODE in mechanics in Chap. 1, via variational principles. We 
obtain equations for vibrating strings and membranes; these equations are not 
necessarily linear, and hence they will also provide sources of problems later, 
when nonlinear PDE is taken up. Further material in Chap. 2 centers around the 
Laplace operator, which on Euclidean space R” is 


om o 
(1) kad beau Bx? 


n 


and the linear wave equation, 


ix 


x Preface 


Ou 


We also consider the Laplace operator on a general Riemannian manifold and the 
wave equation on a general Lorentz manifold. We discuss the basic consequences 
of Green’s formula, including energy conservation and finite propagation speed 
for solutions to linear wave equations. We also discuss Maxwell’s equations for 
electromagnetic fields and their relation with special relativity. Before we can 
establish general results on the solvability of these equations, it is necessary to 
develop some analytical techniques. This is done in the next couple of chapters. 

Chapter 3 is devoted to Fourier analysis and the theory of distributions. These 
topics are crucial for the study of linear PDE. We give a number of basic 
applications to the study of linear PDE with constant coefficients. Among these 
applications are results on harmonic and holomorphic functions in the plane, 
including a short treatment of elementary complex function theory. We derive 
explicit formulas for solutions to Laplace and wave equations on Euclidean 
space, and also the heat equation, 


Ou 

(3) Ot — Au =0. 

We also produce solutions on certain subsets, such as rectangular regions, using 
the method of images. We include material on the discrete Fourier transform, 
germane to the discrete approximation of PDE, and on the fast evaluation of this 
transform, the FFT. Chapter 3 is the first chapter to make extensive use of 
functional analysis. Basic results on this topic are compiled in Appendix A, 
Outline of Functional Analysis. 

Sobolev spaces have proven to be a very effective tool in the existence theory 
of PDE, and in the study of regularity of solutions. In Chap. 4 we introduce 
Sobolev spaces and study some of their basic properties. We restrict attention to 
L?-Sobolev spaces, such as HR"), which consists of L? functions whose 
derivatives of order < k (defined in a distributional sense, in Chap. 3) belong to 
L?(R"), when k is a positive integer. We also replace k by a general real number 
s. The L?-Sobolev spaces, which are very useful for nonlinear PDE, are treated 
later, in Chap. 13 

Chapter 5 is devoted to the study of the existence and regularity of solutions to 
linear elliptic PDE, on bounded regions. We begin with the Dirichlet problem for 
the Laplace operator, 


(4) Au = fonQ, w=gondg, 
and then treat the Neumann problem and various other boundary problems, 


including some that apply to electromagnetic fields. We also study general 
boundary problems for linear elliptic operators, giving a condition that guarantees 


Preface xi 


regularity and solvability (perhaps given a finite number of linear conditions on 
the data). Also in Chap. 5 are some applications to other areas, such as a proof 
of the Riemann mapping theorem, first for smooth simply connected domains in 
the complex plane C, then, after a treatment of the Dirichlet problem for the 
Laplace operator on domains with rough boundary, for general simply connected 
domains in C. We also develop the Hodge theory and apply it to de Rham 
cohomology, extending the study of topological applications of differential forms 
begun in Chap. 1. 

In Chap. 6 we study linear evolution equations, in which there is a “time” 
variable t, and initial data are given at t = 0. We discuss the heat and wave 
equations. We also treat Maxwell’s equations, for an electromagnetic field, and 
more general hyperbolic systems. We prove the Cauchy—Kowalewsky theorem, 
in the linear case, establishing local solvability of the Cauchy initial value 
problem for general linear PDE with analytic coefficients, and analytic data, as 
long as the initial surface is “noncharacteristic.” The nonlinear case is treated in 
Chap. 16. Also in Chap. 6 we treat geometrical optics, providing approximations 
to solutions of wave equations whose initial data either are highly oscillatory or 
possess simple singularities, such as a jump across a smooth hypersurface. 

Chapters 1-6, together with Appendix A and B, Manifolds, Vector Bundles, 
and Lie Groups, make up the first volume of this work. The second volume 
consists of Chaps. 7—12, covering a selection of more advanced topics in linear 
PDE, together with Appendix C, Connections and Curvature. 

Chapter 7 deals with pseudodifferential operators (WDOs). This class of 
operators includes both differential operators and parametrices of elliptic opera- 
tors, that is, inverses modulo smoothing operators. There is a “symbol calculus” 
allowing one to analyze products of DOs, useful for such a parametrix con- 
struction. The L?-boundedness of operators of order zero and the Garding 
inequality for elliptic DOs with positive symbol provide very useful tools in 
linear PDE, which will be used in many subsequent chapters. 

Chapter 8 is devoted to spectral theory, particularly for self-adjoint elliptic 
operators. First we give a proof of the spectral theorem for general self-adjoint 
operators on Hilbert space. Then we discuss conditions under which a differential 
operator yields a self-adjoint operator. We then discuss the asymptotic distribu- 
tion of eigenvalues of the Laplace operator on a bounded domain, making use of 
a construction of a parametrix for the heat equation from Chap. 7. Further 
material in Chap. 8 includes results on the spectral behavior of various specific 
differential operators, such as the Laplace operator on a sphere, and on hyperbolic 
space, the “harmonic oscillator” 


(5) —A+|a), 


and the operator 


xii Preface 


Kk 
(6) Tal 

[2 
which arises in the simplest quantum mechanical model of the hydrogen atom. 
We also consider the Laplace operator on cones. 

In Chap. 9 we study the scattering of waves by a compact obstacle K in R°. 
This scattering theory is to some degree an extension of the spectral theory of the 
Laplace operator on R°\K, with the Dirichlet boundary condition. In addition to 
studying how a given obstacle scatters waves, we consider the inverse problem: 
how to determine an obstacle given data on how it scatters waves. 

Chapter 10 is devoted to the Atiyah—-Singer index theorem. This gives a 
formula for the index of an elliptic operator D on a compact manifold M, defined 
by 


(7) Index D = dim ker D — dim ker D*. 


We establish this formula, which is an integral over M of a certain differential 
form defined by a pair of “curvatures,” when D is a first-order differential 
operator of “Dirac type,” a class that contains many important operators arising 
from differential geometry and complex analysis. Special cases of such a formula 
include the Chern—Gauss—Bonnet formula and the Riemann—Roch formula. We 
also discuss the significance of the latter formula in the study of Riemann 
surfaces. 

In Chap. 11 we study Brownian motion, described mathematically by Wiener 
measure on the space of continuous paths in R”. This provides a probabilistic 
approach to diffusion and it both uses and provides new tools for the analysis 
of the heat equation and variants, such as 


(8) au = —Au+ Vu, 


where V is a real-valued function. There is an integral formula for solutions to (8), 
known as the Feynman—Kac formula; it is an integral over path space with respect 
to the Wiener measure, of a fairly explicit integrand. We also derive an analogous 
integral formula for solutions to 


(9) — = —Au+ Xu, 


where X is a vector field. In this case, another tool is involved in constructing the 
integrand, the stochastic integral. We also study stochastic differential equations 
and applications to more general diffusion equations. 

In Chap. 12 we tackle the 0-Neumann problem, a boundary problem for an 
elliptic operator (essentially the Laplace operator) on a domain Q Cc C”, which is 
very important in the theory of functions of several complex variables. From a 


Preface xiii 


technical point of view, it is of particular interest that this boundary problem does 
not satisfy the regularity criteria investigated in Chap. 5. If Q is “strongly 
pseudo-convex,” one has instead certain “subelliptic estimates,” which are 
established in Chap. 12. 

The third and final volume of this work contains Chaps. 13-18. It is here that 
we study nonlinear PDE. 

We prepare the way in Chap. 13 with a further development of function space 
and operator theory, for use in nonlinear analysis. This includes the theory of L?- 
Sobolev spaces and Hélder spaces. We derive estimates in these spaces on 
nonlinear functions F(u), known as “Moser estimates,” which are very useful. We 
extend the theory of pseudodifferential operators to cases where the symbols have 
limited smoothness, and also develop a variant of DO theory, the theory of 
“paradifferential operators,” which has had a significant impact on nonlinear PDE 
since about 1980. We also estimate these operators, acting on the function spaces 
mentioned above. Other topics treated in Chap. 13 include Hardy spaces, com- 
pensated compactness, and “fuzzy functions.” 

Chapter 14 is devoted to nonlinear elliptic PDE, with an emphasis on 
second-order equations. There are three successive degrees of nonlinearity: 
semilinear equations, such as 


(10) Au = F(a,u, Vu), 

quasi-linear equations, such as 

(11) Soa! *(a,u, Vu)djOcu = F(x,u, Vu), 

and completely nonlinear equations, of the form 

(12) G(x, D?u) = 0. 

Differential geometry provides a rich source of such PDE, and Chap. 14 contains 
a number of geometrical applications. For example, to deform conformally a 
metric on a surface so its Gauss curvature changes from k(x) to K(x), one needs to 
solve the semilinear equation 

(13) Au = k(x) — K(x)e™. 


As another example, the graph of a function y = u(x) is a minimal submanifold of 
Euclidean space provided u solves the quasi-linear equation 


(14) (1+ |Vul”)Au+ (Vu) - H(u)(Vu) = 0, 


xiv Preface 


called the minimal surface equation. Here, H(u) = (0j0,w) is the Hessian matrix 
of u. On the other hand, this graph has Gauss curvature K(x) provided u solves the 
completely nonlinear equation 


(15) det H(u) = K(a)(14|Vul?)°t??”, 


a Monge—Ampére equation. Equations (13)—(15) are all scalar, and the maximum 
principle plays a useful role in the analysis, together with a number of other tools. 
Chapter 14 also treats nonlinear systems. Important physical examples arise in 
studies of elastic bodies, as well as in other areas, such as the theory of liquid 
crystals. Geometric examples of systems considered in Chap. 14 include equa- 
tions for harmonic maps and equations for isometric embeddings of a 
Riemannian manifold in Euclidean space. 

In Chap. 15, we treat nonlinear parabolic equations. Partly echoing Chap. 14, 
we progress from a treatment of semilinear equations, 


Ou 
(16) a Lu+ F(a,u, Vu), 
where L is a linear operator, such as L = A, to a treatment of quasi-linear 
equations, such as 


Ou ei 
(17) an ye (t,x, u)Opu+ X(u). 


(We do very little with completely nonlinear equations in this chapter.) We study 
systems as well as scalar equations. The first application of (16) we consider is to 
the parabolic equation method of constructing harmonic maps. We also consider 
“reaction—diffusion” equations, & x £ systems of the form (16), in which 
F(a,u, Vu) = X(u), where X is a vector field on R°, and L is a diagonal 
operator, with diagonal elements a;A, a; >0. These equations arise in mathe- 
matical models in biology and in chemistry. For example, u = (w1,---,u¢)might 
represent the population densities of each of @ species of living creatures, dis- 
tributed over an area of land, interacting in a manner described by X and diffusing 
in a manner described by a,A. If there is a nonlinear (density-dependent) diffu- 
sion, one might have a system of the form (17). 

Another problem considered in Chap. 15 models the melting of ice; one has a 
linear heat equation in a region (filled with water) whose boundary (where the 
water touches the ice) is moving (as the ice melts). The nonlinearity in the 
problem involves the description of the boundary. We confine our analysis to a 
relatively simple one-dimensional case. 

Nonlinear hyperbolic equations are studied in Chap. 16. Here continuum 
mechanics is the major source of examples, and most of them are systems, rather 
than scalar equations. We establish local existence for solutions to first-order 


Preface XV 


hyperbolic systems, which are either “symmetric” or “symmetrizable.” An 
example of the latter class is the following system describing compressible fluid 
flow: 


a 1 a 
(18) ap t+ Woe tS atadp = 0, oT Ve +p dive =0, 


for a fluid with velocity v, density p, and pressure p, assumed to satisfy a relation 
p = p(p), called an “equation of state.” Solutions to such nonlinear systems tend 
to break down, due to shock formation. We devote a bit of attention to the study 
of weak solutions to nonlinear hyperbolic systems, with shocks. 

We also study second-order hyperbolic systems, such as systems for a 
k-dimensional membrane vibrating in R”, derived in Chap. 2. Another topic 
covered in Chap. 16 is the Cauchy—Kowalewsky theorem, in the nonlinear case. 
We use a method introduced by P. Garabedian to transform the Cauchy problem 
for an analytic equation into a symmetric hyperbolic system. 

In Chap. 17 we study incompressible fluid flow. This is governed by the Euler 
equation 


(19) ” +V,v=-gradp, divv=0, 


in the absence of viscosity, and by the Navier-Stokes equation 


(20) ° +V,v=vLu—gradp, divv=0, 
in the presence of viscosity. Here £ is a second-order operator, the Laplace 
operator for a flow on flat space; the “viscosity” v is a positive quantity. Equation 
(19) shares some features with quasi-linear hyperbolic systems, though there are 
also significant differences. Similarly, (20) has a lot in common with semilinear 
parabolic systems. 

Chapter 18, the last chapter of this work, is devoted to Einstein’s gravitational 
equations: 


(21) Giz = 8K Tix. 


Here G;;; is the Einstein tensor, given by Gj; = Ricj, — (1/2) 9;x, where Ric;, 
is the Ricci tensor and S the scalar curvature, of a Lorentz manifold (or 
“spacetime”) with metric tensor g;;. On the right side of (21), T)j;, is the stress— 
energy tensor of the matter in the spacetime, and k is a positive constant, which 
can be identified with the gravitational constant of the Newtonian theory of 
gravity. In local coordinates, G';;, has a nonlinear expression in terms of gj, and 
its second-order derivatives. In the empty-space case, where Tj; = 0, (21) is a 
quasi-linear second-order system for g;,. The freedom to change coordinates 


xvi Preface 


provides an obstruction to this equation being hyperbolic, but one can impose the 
use of “harmonic” coordinates as a constraint and transform (21) into a hyper- 
bolic system. In the presence of matter one couples (21) to other systems, 
obtaining more elaborate PDE. We treat this in two cases, in the presence of an 
electromagnetic field, and in the presence of a relativistic fluid. 

In addition to the 18 chapters just described, there are three appendices, 
already mentioned above. Appendix A gives definitions and basic properties of 
Banach and Hilbert spaces (of which L’-spaces and Sobolev spaces are exam- 
ples), Fréchet spaces (such as C°®(IR”)), and other locally convex spaces (such as 
spaces of distributions). It discusses some basic facts about bounded linear 
operators, including some special properties of compact operators, and also 
considers certain classes of unbounded linear operators. This functional analytic 
material plays a major role in the development of PDE from Chap. 3 onward. 

Appendix B gives definitions and basic properties of manifolds and vector 
bundles. It also discusses some elementary properties of Lie groups, including a 
little representation theory, useful in Chap. 8, on spectral theory, as well as in the 
Chern—Weil construction. 

Appendix C, Connections and Curvature, contains material of a differential 
geometric nature, crucial for understanding many things done in Chaps. 10-18. 
We consider connections on general vector bundles, and their curvature. We 
discuss in detail the special properties of the primary case: the Levi—Civita 
connection and Riemann curvature tensor on a Riemannian manifold. We discuss 
the basic properties of the geometry of submanifolds, relating the second fun- 
damental form to curvature via the Gauss—Codazzi equations. We describe how 
vector bundles arise from principal bundles, which themselves carry various 
connections and curvature forms. We then discuss the Chern—Weil construction, 
yielding certain closed differential forms associated to curvatures of connections 
on principal bundles. We give several proofs of the classical Gauss—Bonnet 
theorem and some related results on two-dimensional surfaces, which are useful 
particularly in Chaps. 10 and 14. We also give a geometrical proof of the Chern— 
Gauss—Bonnet theorem, which can be contrasted with the proof in Chap. 10, as a 
consequence of the Atiyah—Singer index theorem. 

We mention that, in addition to these “global” appendices, there are appendices 
to some chapters. For example, Chap. 3 has an appendix on the gamma function. 
Chapter 6 has two appendices; Appendix A has some results on Banach spaces of 
harmonic functions useful for the proof of the linear Cauchy—Kowalewsky theo- 
rem, and Appendix B deals with the stationary phase formula, useful for the study 
of geometrical optics in Chap. 6 and also for results later, in Chap. 9. There are 
other chapters with such “local” appendices. Furthermore, there are two sections, 
both in Chap. 14, with appendices. Section 6, on minimal surfaces, has a com- 
panion, Sect. 6B, on the second variation of area and consequences, and Sect. 13, 
on nonlinear elliptic systems, has a companion, Sect. 12B, with complementary 
material. 


Preface XVil 


Having described the scope of this work, we find it necessary to mention a 
number of topics in PDE that are not covered here or are touched on only very 
briefly. 

For example, we devote little attention to the real analytic theory of PDE. We 
note that harmonic functions on domains in R” are real analytic, but we do not 
discuss the analyticity of solutions to more general elliptic equations. We do 
prove the Cauchy—Kowalewsky theorem, on analytic PDE with analytic Cauchy 
data. We derive some simple results on unique continuation from these few 
analyticity results, but there is a large body of lore on unique continuation, for 
solutions to nonanalytic PDE, neglected here. 

There is little material on numerical methods. There are a few references to 
applications of the FFT and of “splitting methods.” Difference schemes for PDE 
are mentioned just once, in a set of exercises on scalar conservation laws. Finite 
element methods are neglected, as are many other numerical techniques. 

There is a large body of work on free boundary problems, but the only one 
considered here is a simple one-space dimensional problem, in Chap. 15. 

While we have considered a variety of equations arising from classical physics 
and from relativity, we have devoted relatively little attention to quantum 
mechanics. We have considered a few quantum systems in Chap. 8, including 
models of the hydrogen atom and the deuteron. Also, there are some exercises on 
potential scattering mentioned in Chap. 9. However, the physical theories behind 
these equations are not discussed here. 

There are a number of nonlinear evolution equations, such as the Korteweg— 
deVries equation, that have been perceived to provide infinite dimensional ana- 
logues of completely integrable Hamiltonian systems, and to arise “universally” 
in asymptotic analyses of solutions to various nonlinear wave equations. They are 
not here. Nor is there a treatment of the Yang—Mills equations for gauge fields, 
with their wonderful applications to the geometry and topology of 
four-dimensional manifolds. 

Of course, this is not a complete list of omitted material. One can go on and on 
listing important topics in this vast subject. The author can at best hope that the 
reader will find it easier to understand many of these topics with this book, than 
without it. 


Acknowledgments 


Ihave had the good fortune to teach at least one course relevant to the material of 
this book, almost every year since 1971. These courses led to many course notes, 
and I am grateful to many colleagues at Rice University, SUNY at Stony Brook, 
the California Institute of Technology, and the University of North Carolina, for 
the supportive atmospheres at these institutions. Also, a number of individuals 
provided valuable advice on various portions of the manuscript, as it grew over 
the years. I particularly want to thank Florin David, David Ebin, Frank Jones, 


XViii Preface 


Anna Mazzucato, Richard Melrose, James Ralston, Jeffrey Rauch, Santiago 
Simanca, and James York. The final touches were put on the manuscript while I 
was visiting the Institute for Mathematics and its Applications, at the University 
of Minnesota, which I thank for its hospitality and excellent facilities. 

Finally, I would like to acknowledge the impact on my studies of my senior 
thesis and Ph.D. thesis advisors, Edward Nelson and Heinz Cordes. 


Preface to the Second Edition 


In addition to making numerous small corrections to this work, collected over the 
past dozen years, I have taken the opportunity to make some very significant 
changes, some of which broaden the scope of the work, some of which clarify 
previous presentations, and a few of which correct errors that have come to my 
attention. 

There are seven additional sections in this edition, two in Volume 1, two in 
Volume 2, and three in Volume 3. Chapter 4 has a new section, “Sobolev spaces 
on rough domains,” which serves to clarify the treatment of the Dirichlet problem 
on rough domains in Chap. 5. Chapter 6 has a new section, “Boundary layer 
phenomena for the heat equation,” which will prove useful in one of the new 
sections in Chap. 17. Chapter 7 has a new section, “Operators of harmonic 
oscillator type,” and Chap. 10 has a section that presents an index formula for 
elliptic systems of operators of harmonic oscillator type. Chapter 13 has a new 
appendix, “Variations on complex interpolation,” which has material that is 
useful in the study of Zygmund spaces. Finally, Chap. 17 has two new sections, 
“Vanishing viscosity limits” and “From velocity convergence to flow 
convergence.” 

In addition, several other sections have been substantially rewritten, and 
numerous others polished to reflect insights gained through the use of these books 
over time. 


Preface to the Third Edition 


I have provided further polishings and supplements for this third edition. New 
material in Volume | includes a section on rigid body motion in Chapter 1, which 
will tie in to the derivation of the Euler equation of incompressible fluid flow in 
Chapter 17. Chapter 3 has a new appendix on the central limit theorem, related to a 
random walk, which will tie in to the treatment of Brownian motion in Chapter 11. 
In addition there is an expanded treatment of the Poisson integral in Chapter 5, a 
section on the Schrédinger equation in Chapter 6, and an expanded treatment of 
holomorphic functional calculus in Appendix A. 


Preface xix 


New material in Volume 2 includes sections on a quantum model of the 
deuteron, a quantum adiabatic theorem, and a quantum ergodic theorem, and 
appendices on the classical ergodic theorem and on shifted wave equations in 
Chapter 8, as well as expanded treatments of the spectral theorem and of analysis 
on hyperbolic space in that chapter. In Chapter 11 I have added a section on 
diffusion on Riemannian manifolds, with application to models of relativistic 
diffusion. 

New material in Volume 3 includes a section on overdetermined elliptic 
systems in Chapter 14 and a section on Euler flows on rotating surfaces, influ- 
enced by the Coriolis force, in Chapter 17. 


Chapel Hill, USA Michael E. Taylor 


Contents 


Contents of Volumes I and WT .................. 00.0. .0 00.0000. Vii 
PiGG0R 6. cic 35h oN $4 R4G S004 OP Ra a OM WREEEEERE OS TRG AS SEES EES ix 
7 Pseudodifferential Operators .................. 020.00 eee ee 1 
1 The Fourier integral representation and symbol classes........ 2 

2 Schwartz kernels of pseudodifferential operators............. 5 

3 Adjomis and products... .cs¢ssesieeieeere eee be awadees 12 

4 Elliptic operators ond parametices... 066. eee cae nd Woe wee i 

© me, 25 5h4k webb caks sees yaeiwetesoeahewehs 19 

& rarities mequally 2.404 we G56 oa ek Oder Oa eee SR RES 23 

+ Hyperbolic evolution GqUatons +... .c04 daveb aendwakd cans ae) 

& Esorov’s theorem ¢.ic 45 cdcbcbeee dab cdavaerarneeencees 28 

& Wherlocal eevlary. nok ook ee CEE wee ARE aes 31 

10 Opetatorson manifolds ...c5445e5554Keeee eee ees een ees 34 

1] ‘The method of layer potentials .on5 acs nee eee dene bee ee es a7 

12 Parametrix for regular elliptic boundary problems............ nie 

13 Paramettix for the heat equation sc. ..ccseteneuverennnees 61 

ta. The Weyl CoS a ae ee hans Rk bee LWERR EAE ASE WES 72 

15 Operators of harmonic oscillator type..................0.. 84 

16 Positive quantization of C°(S*M) ... 0... eee eee 92 
RGISTCNCES 625.0 64445454050 GHEE CRREDERE GD REDE RT EEO oF 

S Spectral Theary 3.44.6 cca ces iue cee ge ivenesadltabawseas 101 
i “The Special Memes oii tes tee Coho h oer eed RR a4 103 

2 Self-adjoint differential operators....................000. 112 

3 Heat asymptotics and eigenvalue asymptotics............... 119 

4 ‘The Laplace operstar on 8” 2.04442 456.4ee03064244% veas 125 

5 The Laplace operator on hyperbolic space ................. 136 

GO - Ue Trion eC: ose ees eae eRe Ree eas eS 139 

7 The quantum Coulomb problem ........................ 149 

8 Potential well-quantum model of a deuteron ............... 162 

o The Laplace operator Of CONES. «255i cec bce ea dhe eee ees 168 

10 Quantum adiabatic limit and parallel transport .............. 191 

ll A quantum ergodie theorem 4.6255 cos e0ch ee eR de wR ER RES 194 


A 
B 


9 pes by Obstacles ...... 


Contents 


Von Neumann’s mean ergodic t 
Wave equations and shifted w 
References .... 0... 000 ccc cee eee ene e nn eneeuas 


The scatterit Ig problem . . 


ing epenaibe 5 ee 
-ctions with the wave er PHYS Riwaesawed ban eek t 
Wave operators .. ; ; 
Translation representation and the eee 

semigroup Zt) . ied eal hs Soe (bt beeee aed aeerwses 
ee scat ring enn. iwk AS ew RS Rika aed 
- scattering phase .......... 


10 Dirac Operators and Index viata ped ued oeeeenet eee caeks 


1 


— Oo 


Operators ot Diss Gipe.ccres 224s eRe Se lowe eee es wees 
AOI PE eo 5k wed sak Wee ech kee See adaawiawes 
Spinors ....... ss Segal gd ded wh sp EE ada hoe eed 
Welizanbock Toxmula:,. ee ee 
fides of Mine ERNE o -aca anise os esa <eeaes Avery dct 
Proof of the local index formula ................0000005. 
The Chern—Gauss—Bonnet OTE crs bx ded ag dtr oral Are 


Index of | operator of harmonic oxeitator tyes. Hemnannases 
RetCHE SS 5 3552444 RRH Ge Ow FORE KRESS RE RETR 


11 Brownian Motion and Potential Theory............. ding ahi 


Brownian motion and Wiener measure. eee ee 
The RepimaniKag Patil... ..:++<4<ad4a00eace0s soi, He 
The Dii problem and diffusion on domains 


First eit time — ‘the Feen: integral be be So ee athe dbo re 


202 
205 
210 


215 
217 
227 
232 
237 
245 


251 
258 
272 
219 
288 
294 
306 
315 
318 
323 
325 
331 
336 
342 
348 
351 
358 
362 
367 
380 
387 
400 


403 
406 
413 
417 


426 
436 


Contents Xxiii 


i “TCU CARR se hen coh eas dua vee news . 440 
7 : 455 
8 Stochastic integrals, Uo. ..ccccccscaesbescsaustinencsas 466 
9 Stochastic differential equations................. Rabies aiid d 473 
10 Application to equations of diffusion . pia tevkecentianae SD 
11 Diff Riemannian manifolds.................. wee. 491 
A The er anit formula ...............00-e00-+2--. 514 

References . ih Bh pce Halk. & tina deliske, Pipsecte nie heh tatiana: eee 


12 The 0-Neumann Problem .....................-.-0-0202 523 


ke 
4 
6 The Hodge decomposition and the ee Maouah ees 
7 
8 
9 


The Bergman projection and Toeplitz operators ............. 552 

The 0-Neumann problem on (0,q)-forms............. iene OU 
9 Reduction to pseudodifferential equations on the trek ary <r 569 
10 The ee on complex manifolds and almost complex 

manifolds . (PARAMS EER CRE RO aN Rd Bee OOS 
B Citiglemmaks« on the Le fotet ip iceabes kaa a ae eee eee 594 
C The Neumann operator for the Dirichlet problem 598 


DPRREBOEE ok do kd oc, dio. ca ah bd bebe onaadanagnbadee 602 


C Connections and Curvature . ie Shnewadeta ude TOD 
Covariant derivatives ad curvature on e eral 
vector oo esi tackt Festi 


Weenies oo ck tee eeelk tHE EAE SE SRE RSE P eee 681 


Dee cc hi yedieetatene ie pesiyegtsy dan Lntsmaveereuwcueasie “Ue 


7 ® 


Check for 
updates 


Pseudodifferential Operators 


Introduction 


In this chapter we discuss the basic theory of pseudodifferential operators as it 
has been developed to treat problems in linear PDE. We define pseudodifferential 
operators with symbols in classes denoted oe , introduced by L. Hérmander. In 
§2 we derive some useful properties of their Schwartz kernels. In §3 we discuss 
adjoints and products of pseudodifferential operators. In 84 we show how the 
algebraic properties can be used to establish the regularity of solutions to elliptic 
PDE with smooth coefficients. In §5 we discuss mapping properties on L? and on 
the Sobolev spaces H*. In 86 we establish Garding’s inequality. 

In 87 we apply some of the previous material to establish the existence of 
solutions to hyperbolic equations. In §8 we show that certain important classes 
of pseudodifferential operators are preserved under the action of conjugation by 
solution operators to (scalar) hyperbolic equations, a result of Y. Egorov. We intro- 
duce the notion of wave front set in §9 and discuss the microlocal regularity of 
solutions to elliptic equations. We also discuss how solution operators to a class 
of hyperbolic equations propagate wave front sets. In §10 there is a brief discus- 
sion of pseudodifferential operators on manifolds. 

We give some further applications of pseudodifferential operators in the next 
three sections. In $11 we discuss, from the perspective of the pseudodifferential 
operator calculus, the classical method of layer potentials, applied particularly to 
the Dirichlet and Neumann boundary problems for the Laplace operator. Histor- 
ically, this sort of application was one of the earliest stimuli for the development 
of the theory of singular integral equations. One function of §11 is to provide a 
warm-up for the use of similar integral equations to tackle problems in scattering 
theory, in §7 of Chap. 9. We also discuss the development of layer potential theory 
on various classes of domains with rough boundary. Section 12 looks at general 
regular elliptic boundary problems and includes material complementary to that 
developed in §11 of Chap. 5. In 813 we construct a parametrix for the heat equa- 
tion and apply this to obtain an asymptotic expansion of the trace of the solution 
operator. This expansion will be useful in studies of the spectrum in Chap. 8 and 
in index theory in Chap. 10. 


© The Author(s), under exclusive license to Springer Nature Switzerland AG 2023 1 
M. E. Taylor, Partial Differential Equations I, Applied Mathematical Sciences 116, 
https://doi.org/10.1007/978-3-03 1-33700-0_7 


2 7. Pseudodifferential Operators 


In §14 we introduce the Weyl calculus. This can provide a powerful alternative 
to the operator calculus developed in §8 1-6, as can be seen in [Ho4] and in Vol. 3 
of [Ho5]. Here we concentrate on identities, tied to symmetries in the Weyl cal- 
culus. We show how this leads to a quicker construction of a parametrix for the 
heat equation than the method used in §13. We will make use of this in §10 of 
Chap. 10, on a direct attack on the index theorem for elliptic differential operators 
on two-dimensional manifolds. 

In §15, we study a class of pseudodifferential operators of “Harmonic oscillator 
type.” This class contains the Harmonic oscillator, 


H=—A = |x|?, 


with symbol ||? + |€|?, and results on these operators are interesting variants on 
those with symbols in S7"o. 

In §16, we return to the issue of defining pseudodifferential operators on a 
compact manifold M, phrasing it as a quantization of smooth functions on S* M. 
We discuss the existence of positive quantizations of C°(S*M). Results here 
will be of use in the treatment of quantum ergodic theorems, in Chapter 8. 

Material in §§1-10 is taken from Chap.0 of [T4], and the author thanks 
Birkhauser Boston for permission to use this material. We also mention some 
books that take the theory of pseudodifferential operators farther than is done 
here: [Ho5, Kg, T1], and [Tre]. 


1. The Fourier integral representation and symbol classes 


Using a slightly different convention from that established in Chap. 3, we write 
the Fourier inversion formula as 


(1.1) fas / f(6) e dé, 


where f(€) = (27)~” J f(«)e~**s dz is the Fourier transform of a function on 
R”. If one differentiates (1.1), one obtains 


(1.2) Dia\= i E~f(O)e"* dé, 


where D® = Dj -++ D8", D; = (1/1) 0/Oz;. Hence, if 


nm? 


p(a, D) = S- Aq (x)D® 


|a|<k 


is a differential operator, we have 


1. The Fourier integral representation and symbol classes 3 


(1.3) p(x, D) f(x) = / p(x, €)f(é)e§ dé 


where 


p(,€)= >> aa(a)é*. 


la|<k 


One uses the Fourier integral representation (1.3) to define pseudodifferential 
operators, taking the function p(a,&) to belong to one of a number of different 
classes of symbols. In this chapter we consider the following symbol classes, first 
defined by Hérmander [Ho2]. 

Assuming p,d € [0,1], m © R, we define SOs to consist of C’°-functions 
p(a, &) satisfying 


(1.4) |D2 DE p(x, €)| < Capley7—el2!F541, 


for all a, 3, where (€) = (1 + |€|)1/?. In such a case we say the associated 
operator defined by (1.3) belongs to OP p.5+ We say that p(a,&) is the symbol of 
p(a, D). The case of principal interest is o = 1, 6 = 0. This class is defined by 
[KN]. 

Recall that in Chap. 3, §8, we defined P(€) € Sj”(IR”) to satisfy (1.4), with 
p = 1, and with no z-derivatives involved. Thus Sj") contains S{”(R”). 

If there are smooth p,,— ;(a, €), homogeneous in of degree m — j for |€| > 1, 
that is, Pm_—j(x,r&) = r™ I pm_;(x, €) for r, |é| > 1, and if 


(1.5) p(«, €) ~ >" Dm—5(2, €) 


j20 


in the sense that 
N 
(1.6) p@,8)— > pale.) € Sig 
j=0 


for all N, then we say p(x,€) € S", or just p(x,€) € S™. We call p,(z, €) 
the principal symbol of p(x, D). We will give a more general definition of the 
principal symbol in §10. 

It is easy to see that if p(x, €) € S7”; and p, é € [0,1], then p(x, D) : S(R") > 
C~(R”). In fact, multiplying (1.3) by 2%, writing x%e'*§ = (—De)%e'*6, and 
integrating by parts yield 


(1.7) p(a,.D) : S(R") —> S(R”). 
Under one restriction, p(2, D) also acts on tempered distributions: 


Lemma 1.1. /f 6 < 1, then 


4 7. Pseudodifferential Operators 
(1.8) p(x, D) : S'(R") — S’(R"). 
Proof. Given u € S’, v € S, we have (formally) 


(1.9) (v, p(x, D)u) = (pv, @), 


where 
py(&) = (ony f v(z)ple,ge* dz. 


Now integration by parts gives 


€°D (27)~ " [ De(u(e) p(x, €))e tek da, 


SO 
PaO) < Cc elle, 


Thus if 6 < 1, we have rapid decrease of p,(€). Similarly, we get rapid decrease 
of derivatives of p,(&€), so it belongs to S. Thus the right side of (1.9) is well 
defined. 


In 85 we will analyze the action of pseudodifferential operators on Sobolev 
spaces. 

Classes of symbols more general than S’”, have been introduced by R. Beals 
and C. Fefferman [BF, Be], and still ini general classes were studied by 
Ho6rmander [Ho4]. These classes have some deep applications, but they will 
not be used in this book. 


Exercises 


1. Show that, for a(x, €) € S(R?”), 


(1.10) a(x, Du = f a(a,p) e'tXe'PPu(x) dq dp, 


where G(q, p) is the Fourier transform of a(x, €), and the operators e’?* and e'?? 


defined by 


are 
et u(x) =e? u(x), e'? Pu(x) = u(x +p). 
2. Establish the identity 
(1.11) EP Pett X _ ott pia X ip D 
Deduce that, for (t, ¢,p) € R x R” x R” = H”, the binary operation 
(1.12) (Lapo(t.g p)=(t+t +p-d.q+q ptr) 
gives a group and that 


(1.13) z(t, q, p) = eet * e'? * 


2. Schwartz kernels of pseudodifferential operators 5 


defines a unitary representation of H” on L?(R”); in particular, it is a group homomor- 
phism: 7(z 0 z’) = 7(z)i(z’). H” is called the Heisenberg group. 
3. Give a definition of a(x — q, D — p), acting on u(a). Show that 


a(x — q,D — p) = %(0,4q,p) a(x, D) (0, ¢,p)~*. 


4. Assume a(x, €) € $7"; and b(x,€) € S(R” x R”). Show that c(a, €) = (b * a)(z, €) 
belongs to S7"5 (« being convolution on R?”). Show that 


o(, Du = / b(y,n) ale — y,D —n) dy da. 


5. Show that the map U(p, uv) = p(#, D)u has a unique, continuous, bilinear extension 
from S75 x S(R”) — S(R”) to 
Ww : S’(R*”) x S(R”) — S'(R”) 


so that p(x, D) is “well defined” for any p € S’(R” x R”). 
6. Let x(€) € C§°(R”) be 1 for |€| < 1, xe(€) = x(€€). Given p(x, €) € S7"5, let 
pe(@,€) = xXe(€) p(x, €). Show that if p,d € [0, 1], then 


(1.14) u € S(R") => p(x, D)u > p(a, D)u in S(R"). 
If also 6 < 1, show that 
(1.15) ué S'(R”) = pe(x, D)u = p(a, D)u in S'(R”), 


where we give S’(IR”) the weak* topology. 
7. Fors € R, define A® : S’(R”) — S’(R") by 


(1.16) A*ua) = f (6)*a() e* ag, 
where (€) = (1 + |é|?)!/*. Show that AS € OPS®. 
8. Given p;( 


x,E) € ee for 7 > 0, with p,6 € [0,1] and m; \, —oo, show that there 
exists p(x,€) € S73 such that 


p(x, §) as Sa, §), 


j29 


in the sense that, for all k, 


2. Schwartz kernels of pseudodifferential operators 


To an operator p(x, D) € OP p.5 defined by (1.3) there corresponds a Schwartz 
kernel K € D’(R” x R”), satisfying 


6 7. Pseudodifferential Operators 


(u()o(y), K) = i: i u(x)p(o, )0(Q)e™E dé dex 


(2.1) = (on)-" 7 i) (i u(x)p(ax, Je" Sv(y) dy dg de. 


Thus, / is given as an “oscillatory integral” 


(2.2) K = (2n)-” / p(a, Eject dé, 


We have the following basic result. 
Proposition 2.1. [fp > 0, then K is C° off the diagonal in R” x R”. 


Proof. For given a > 0, 
(23) (e-y)°K = f -* Depa, €) dé. 


This integral is clearly absolutely convergent for |a| so large that m — pla| < —n. 
Similarly, it is seen that applying 7 derivatives to (2.3) yields an absolutely con- 
vergent integral provided m + j — pla| < —n, so in that case (x — y)°K € 
C/(R” x R”). This gives the proof. 
Generally, if T has the mapping properties 
T : Cs°(R") — C™(R"), T:€'(R") > D’(R”), 
and its Schwartz kernel K is C° off the diagonal, it follows easily that 


sing supp Tu C sing supp u, for u € €’(R”). 


This is called the pseudolocal property. By (1.7)— (1.8) it holds for T € OPS7; 
ifp>Oandd <1. 
We remark that the proof of Proposition 2.1 leads to the estimate 


(2.4) |D2 KI < Clz—yl*, 
where k > 0 is any integer strictly greater than (1/p)(m +n + |G|). In fact, this 
estimate is rather crude. It is of interest to record a more precise estimate that 


holds when p(x, €) € S75. 


Proposition 2.2. [f p(x, £) € Sis, then the Schwartz kernel K of p(x, D) satis- 
fies estimates 


B —n—m-—|6 
OS) |\D8 ,K| < Cz —y| lid 


2. Schwartz kernels of pseudodifferential operators 7 


provided m + |G| > —n. 

The result is easily reduced to the case p(x, €) = p(&), satisfying |D%p(€)| < 
C.(€)™—!@, for which p(D) has Schwartz kernel K = p(y — 2). It suffices to 
prove (2.5) for such a case, for 3 = 0 and m > —n. We make use of the following 
simple but important characterization of such symbols. 

Assertion. Given p(£) € C'°(IR”), it belongs to S/o if and only if 
(2.6) p,(€) =r p(r&) is bounded in C° (1 < |€| < 2), forr € [1, 00). 


Using this, we establish the following. 


Lemma 2.3. Given p(€) € Sj"9(IR"), we can write 


with po(€) € C™°(R") and 
{e7 q-(€) : T € [0, c0)} bounded in S(R"). 


Proof. We describe the following decomposition of a symbol p(€) € S7%4(R°). 
Start with 


A € CG°(R”), radial, supported in {€ € R" : 1 < |E| < 2}. 


Set 


B(f) = 7 gece 


a radial function, satisfying 
B(r§) = BE), 


for r > 0. Hence B(&) is constant. Scaling, we can arange that B(€) = 1. Write 


1 = B(é) = Bo(€) + Bi(é), 


with = = 
Be)= f Aterear, Bole) = ff Ae*e) ar. 
We have 
By(€) =Ofor|€) <1, Bo(€) = 0 for |é| > 2, 
hence 


Bo€ C5° (R”). 


8 7. Pseudodifferential Operators 


Now write 
p(é) = Bo()p(é) + Bi(p() 
= po(é) + /, p()A(e77€) dr 
= pe) | ” Gle78) ae 
where 


dr (&) = ple") ACE). 
For each 7, g, is smooth and supported in {€ € R” : 1 < |€| < 2}. Furthermore, 
by (2.6), 
p(§) € Sy%(R”) 
=> {e-""q, : T € [0, c0)} is bounded in S(R”). 


This proves Lemma 2.3. 
To proceed with the proof of Proposition 2.2, in case p(x,&) = p(&), we have 
from Lemma 2.3 that 


{e '""G,(z) : 7 € [0, co) } is bounded in S(R”). 


N 


In particular, e~"7 |G-(z)| < Cn (z)~*, so 


ta(2)| < |Bo(2)| + Cw | emir (1 + eT 2) Nar 
0 


(2.7) <e+ Cyan f eee + é") “adr, 


log |2| 


which implies (2.5), for 9 = 0, m > —n. 

We also see that in the case m + |3| = —n, we obtain a result upon replacing 
the right side of (2.5) by C log | — y|~+, (provided |x — y| < 1/2). 

We can get a complete characterization of P(x) € S’(R”), given P(€) € 
Si?(R”), provided —n < m < 0. 


Proposition 2.4. Assume —n < m < 0. Let q € S’(R") be smooth outside 
the origin and rapidly decreasing as |x| > co. Then q = P for some P(€) € 
Si" (R”) if and only if q € Lj,.(IR") and, for x # 0, 


(2.8) |DEq(z)| < Cz |2\— "I. 


Proof. That P € S7”(R”) implies (2.8) has been established above. For the 
converse, write ¢ = go(%) + )2js0 Yj (x)a(x), where Yo € CG°(R”) is sup- 
ported in 1/2 < |a| < 2, ¥j(%) = Yo(2?z), D559 Ys (2) = 1 on |2| < 1. Since 
lg(x)| < Clz|-"~-™, m < 0, it follows that 5> w;(x)q(x) converges in L'-norm. 
Then go € S(R”). The hypothesis (2.8) implies that 27"9~™J qj (2-Ja)q(2-J a) 


2. Schwartz kernels of pseudodifferential operators 9 


is bounded in S(R”), and an argument similar to that used for Proposition 2.2 
implies Go(£) + 7}=0(¥s4) “(€) € S7(R"). 


We will deal further with the space of elements of S’(IR”) that are smooth 
outside the origin and rapidly decreasing (with all their derivatives) at infinity. We 
will denote this space by S$(R”). 

If m < —n, the argument above extends to show that (2.8) is a sufficient 
condition for q = P with P € S 7’ (R”), but, as noted above, there exist symbols 
P € S'(R") for which q = P does not satisfy (2.8). Now, given that g € 
Sj(R”), it is easy to see that 


(2.9) Vae F(SPt (R")) <> ge F(ST(R”)). 


Thus, if —n —1 < _m < —n, then Proposition 2.4 is almost applicable to Vq, for 
n> 2. 


Proposition 2.5. Assume n > 2 and —n—1<m < —n. Ifq € S6(R")N Lj. 
then q = P for some P € S™(R") if and only if (2.8) holds for || > 1. 


Proof. First note that the hypotheses imply q € L1(IR"); thus ¢(£) is continuous 
and vanishes as |€| —> oo. In the proposition, we need to prove the “if” part. To use 
the reasoning behind Proposition 2.4, we need only deal with the fact that Vq is 
not assumed to be in Lj,,. The sum > 7);(a)Vq(z) still converges in L*(R”), and 
so Vq— >>; (x)Vq is a sum of an element of S(R”) and possibly a distribution 
(call it v) supported at 0. Thus #(£€) is a polynomial. But as noted, G(€) is bounded, 
so »(€) can have at most linear growth. Hence 


EG(€) = Pi(€) + 4(€), 


where P; € $1""1(IR") and £;(€) is a first-order polynomial in £. Since g(€) > 0 
as |€| + oo andm+1< —n+1 < 0, we deduce that £;(€) = cj, a constant, 
that is, 


(2.10) &G(€ =P(+c,, Pie sm (R"), m+1<0. 


Now the left side vanishes on the hyperplane €; = 0, which is unbounded if n > 2. 
This forces c; = 0, and the proof of the proposition is then easily completed. 


If we take n = 1 and assume —2 < m < —l1, the rest of the hypotheses of 

Proposition 2.5 still yield (2.10), so 

d F 
es = Py + C10. 

dx 
If we also assume q is continuous on R, then c,; = O and we again conclude that 
q = Pwith P € S(R). Butif g has a simple jump at x = 0, then this conclusion 
fails. 


10 7. Pseudodifferential Operators 


Proposition 2.4 can be given other extensions, which we leave to the reader. 
We give a few examples that indicate ways in which the result does not extend, 
making use of results from §8 of Chap. 3. As shown in (8.31) of that chapter, on 
R”, 


(2.11) v = PF |x|-" = 6(€) =C,, log ||. 


Now v is not rapidly decreasing at infinity, but if y(x) is a cut-off, belonging 
to C§°(R”) and equal to 1 near x = 0, then f = gv belongs to Sj(R”) and 
f = cG x 6 behaves like log |€| as |€] + oo. One can then deduce that, for n = 1, 


(2.12) f(x) = v(x) log |x| sgn |x| => f(€) ~ C E7} log |é|, || + 00. 


Thus Proposition 2.5 does not extend to the case n = 1, m = —1. However, we 
note that, in this case, f belongs to oo (IR), for all ¢ > 0. In contrast to (2.12), 
note that, again for n = 1, 


(2.13) g(x) = v(x) log |a] => 9(€) ~ Clé\“*, _ |E| 4 0. 


In this case, (d/dx) log |x| = PV(1/z). 

Of considerable utility is the classification of F($™(IR")). When m = —j isa 
negative integer, this was effectively solved in §§8 and 9 of Chap. 3. The following 
result is what follows from the proof of Proposition 9.2 in Chap. 3. 


Proposition 2.6. Assume q € S§(R")N Li, 


joc(R”). Let 7 = 1,2,3,.... Then 
q = P for some P € S{))(R") if and only if 


(2.14) a~ >~ (ae + pe(x) log |z), 
e>0 

where 

(2.15) ge € HH, ,_,(R”), 


and pe(«) is a polynomial homogeneous of degree j + €—n; these log coefficients 
appear only for > n— j. 


We recall that He (R”) is the space of distributions on R”, homogeneous of 
degree j1, which are smooth on R” \ 0. For p > —n, H#(R") C Li,,(IR”). The 
meaning of the expansion (2.14) is that, for any k € Z*, there is an N < co such 
that the difference between q and the sum over ¢ < N belongs to C*(R"). Note 
that, for n = 1, the function g(a) in (2.13) is of the form (2.14), but the function 
f(a) in (2.12) is not. 

To go from the proof of Proposition 9.2 of Chap. 3 to the result stated above, it 
suffices to note explicitly that 


Exercises 11 
(2.16) y(x)ax* log |a| € F(S7""'“(R")), 


where ¢ is the cut-off used before. Since F intertwines D¢ and multiplication by 
x, it suffices to verify the case a = 0, and this follows from the formula (2.11), 
with «x and € interchanged. 

We can also classify Schwartz kernels of operators in OPS{") and OPS??, if 
we write the kernel K of (2.2) in the form 


with 
(2.18) Le, 2) = (2m)-” / pl, Ee dl. 


The following two results follow from the arguments given above. 


Proposition 2.7. Assume —n < m < 0. Let L € S’(R” x R”) be a smooth 
function of x with values in S)(R") N L1(IR"). Then (2.17) defines the Schwartz 
kernel of an operator in OPS‘ if and only if, for z # 0, 


(2.19) |D?8 DIL (a, z)| < Cay |z\-7— 


Proposition 2.8. Assume L € S'(IR” x R”) is a smooth function of x with values 
in S6(R") NO L1(R"). Let j = 1,2,3,.... Then (2.17) defines the Schwartz kernel 
of an operator in OPS., if and only if 


(2.20) L(x, z) ~ So (ae(2, z) + pe(2, z) log |z|), 
e>0 


where each D&qo(x,-) is a bounded continuous function of x with values in 


wit 


Sdn and p¢(x, z) is a polynomial homogeneous of degree j + £ —n in z, 
with coefficients that are bounded, together with all their x-derivatives. 


Exercises 


1. Using the proof of Proposition 2.2, show that, given p(x, €) defined on R” x R”, then 
|DeDEv(#,8)| SCE)", for || <1, lal Sn +1416), 
implies 
|K(a,y)| < Cla—y|- and |VayK(a,y)| < Cle—yl-"™. 


2. If the map « is given by (2.2) (i.e., «(p) = K) show that we get an isomorphism 
k: S’(R?") — S’(R?”). Reconsider Exercise 3 of §1. 


12 7. Pseudodifferential Operators 


3. Show that «, defined in Exercise 2, gives an isomorphism (isometric up to a scalar 
factor) « : L?(IR?”) + L?(R?"). Deduce that p(a, D) is a Hilbert-Schmidt operator 
on L?(R"), precisely when p(x, €) € L?(R?”). 


3. Adjoints and products 


Given p(x,€) € p.5» We obtain readily from the definition that the adjoint is 
given by 


(3.1) p(x, D)*v = ony” f vly, gyre" oly dy d&. 


This is not quite in the form (1.3), as the amplitude p(y, €)* is not a function of 
(x, €). We need to transform (3.1) into such a form. 

Before continuing the analysis of (3.1), we are motivated to look at a general 
class of operators 


(3.2) Au(z) =n)" f a(e,y, Set Suly) dy dé. 
We assume 
(3.3) |D1 D8 Dea(z,y, £)| < Copy E)™ Plat +5114 Sala 


and then say a(x, y,&) € cn, 5, A brief calculation transforms (3.2) into 
G.) (ony-n f g(x.g)e"- uly) dy a, 
with 


q(z, €) = (ony-r f a(e,y. meteors) dy dn 


iDe-D 


(3.5) 


=e Ya), 6) lye 


Note that a formal expansion e*?s-Pv = J + iDeg + Dy — (1/2)(De - Dy)? +++: 


gives 


Ja 
(3.6) q(x,€)~ >> — DeDyalr,y,8)|y-.: 
a>0 : 


3. Adjoints and products 13 


If a(z,y,§) € nepaee WHE OS dg < p <1, then the general term in (3.6) 


belongs to ra mae where 6 = max(01, 62), so the sum on the right is for- 


mally asymptotic. This suggests the following result: 


Proposition 3.1. [fa(x,y,€) € 50's, 5,, with 0 < 62 < p <1, then (3.2) defines 
an operator 


Ae OPS, 6 =max(di, 62). 
Furthermore, A = q(x, D), where q(«,&) has the asymptotic expansion (3.6), in 
the sense that 


jlal 
g(x, £) — > ay De Diia(e,y, §)| 


ja|<N 


m—N(p—6 
yore) ea 


To prove this proposition, one can first show that the Schwartz kernel 


K (x,y) = ery” f a(z,y.gete* dé 


satisfies the same estimates as established in Proposition 2.1, and hence, altering 
A only by an operator in OPS~*, we can assume a(x, y,€) is supported on 
|x — y| <1. Let 


3.7) i(e.n.g) =n)” f alz,a + y.8)e" dy, 
sO 
3.8) a(2.6) = f baxn.§ +n) dr 


The hypotheses on a(x, y, €) imply 
3.9) [DE Dz 6 (a, 0, 6) < Crap(eyrrsieltsar—elal ny”, 


where 6 = max (01, 62). Since 62 < 1, it follows that q(a,€) and any of its 
derivatives can be bounded by some power of (). 

Now a power-series expansion of b(x,7,€ + 7) in the last argument about € 
gives 


iemétm— So GGDS*6(en Oa 
lal<N 


(3.10) < Cy |n|N (n)-” (€ + tyy™td2v—eN 


sup 
O<t<1 


Taking v = N, we get a bound on the left side of (3.10) by 


14 7. Pseudodifferential Operators 
m—(p—62)N; 1 
G.11) Cyn" O-ON if In < Sel 
while taking v large, we get a bound by any power of (7)~! for |€| < 2|n]|. Hence 


1 
(3.12) la(@, €) - » ot Pe)” D° yun, ct+y, QI, | <C (eminaie- 52)N 
jal|<N : 


The proposition follows from this, plus similar estimates on the difference when 
derivatives are applied. 
If we apply Proposition 3.1 to (3.1), we obtain: 


Proposition 3.2. [f p(x, D) € OPS™,,0 <6 < p<, then 


po? 
with 
jlel 
(3.14) p*(,6)~>>* De Divl«, §)*. 
a>0 : 


The result for products of pseudodifferential operators is the following. 


Proposition 3.3. Given p;(x,D) € OPS". 15,7 SUppose 


(3.15) 0<d09<p<1, with p = min(p1, po). 
Then 
(3.16) pi(x, D)pe(a,D) = q(a,D) € OPS”, 


with 6 = max(01, 62), and 
1a 
a a a 
(3.17) g(x,€) ~ > — De pi (a, €) De po(2, €). 
a! 
a>0 
This can be proved by writing 
(3.18) pi(a, D)po(a, D)u = p(x, D)ps(x, D)*u = Au, 
for A as in (3.2), with 


(3.19) ala, y,&) = pi(x, €)p5(y, €)*, 


and then applying Propositions 3.1 and 3.2, to obtain (3.16), with 


3. Adjoints and products 15 


glol-ly! 
(3.20) a(z,€)~ \> op EDS (p(w. €) D2 Dra §)| 


y,0>0 aad 
The general term in this sum is equal to 
glol-lyl ae 
——_ Df (11(#,)DYD}*"p2(x,£)). 
oy: 
Evaluating this by the product rule 
oO o a 
Di(w)= >> (7) eu Dev 
a+B=o 


gives 


1Bl—-lyI 
v7 
(3.21) a i Depi(z p> a ——— De DB +9 p(x, €). 


a 


That this yields (3.17) follows from the fact that, whenever |j:| > 0, 


{l6I-HI 
(3.22) ee DE*1DE*1+%p9(x, £) = 0, 


an identity we leave as an exercise. 
An alternative approach to a proof of Proposition 3.3 is to compute directly 
that p(x, D)po(x, D) = q(«, D), with 


q(a, €) = (27)~” Jo (x; n)pa(y, €)et@— 9) -§) dn dy 


4 


=e Pu Pun (x, n)pa(y, é)| 


(3.23) 
y=x,n=k’ 


and then apply an analysis such as used to prove Proposition 3.1. Carrying out this 
latter approach has the advantage that the hypothesis (3.15) can be weakened to 


O<d.<p1 <1, 


which is quite natural since the right side of (3.17) is formally asymptotic under 
such a hypothesis. Also, the symbol expansion (3.17) is more easily seen from 
(3.23). 

Note that if P} = p;(x, D) € OPS? 3 are scalar, and 0 < 6 < p <1, then the 
leading terms in the expansions of the bee of P; Pz and P2P, agree. It follows 
that the commutator 

[Pi, Pe] = PiP, — PoP 


has order lower than m1 + ™z. In fact, the symbol expansion (3.17) implies 


(3.24) P; € OPS" scalar => [P;, Po] e OPS™ TMP) 


16 7. Pseudodifferential Operators 


Also, looking at the sum over |a| = 1 in (3.17), we see that the leading term in 
the expansion of the symbol of [P;, P2] is given in terms of the Poisson bracket: 


1 m1 +-mM2— — 
(3.25) [Pi, Pa] =a(2,D), 9(2,€) = —{pi,pa}(x, €) mod Spt, 


The Poisson bracket {/1, po} is defined by 


Op, O Op, O 
(3.26) {p1,p2}(x,€) = S- P1 Op2 P1 Ope 
J 


0&; Ox; Ox; 0&; , 


as in §10 of Chap. 1. 
The result (3.25) plays an important role in the treatment of Egorov’s theorem, 
in §8. 


Exercises 


1. Writing a;(x, D) in the form (1.10), that is, 


(3.27) a;(x,D) = [ alanye**er? dq dp, 


use the formula (1.11) for e’”’ Pe’? * to express a1 (x, D)a2(x, D) as a 4n-fold inte- 


gral. Show that it gives (3.20). 
2. If Q(x, x) is any nondegenerate, symmetric, bilinear form on R”, calculate the kernel 
Ka(a, y,t) for which 


(3.28) etd DPa(ge) = [ Kole. y, t) u(y) dy. 
a 


In case x € R” is replaced by (x, €) € R?”, use this to verify (3.5). 
(Hint: Diagonalize Q and recall the treatment of e”’“ in (6.42) of Chap. 3, giving 


e #45 (x) = (amit)? e®/4# og ORT. 
Compare the treatment of the stationary phase method in Appendix B of Chap. 6.) 


3. Establish the identity (3.22), used in the proof of Proposition 3.3. 
(Hint: The left side of (3.22) is equal to 


jello ae 
> Bly! De Dz p2(x, €), 
Beyap PN 


so one needs to show that the quantity in parentheses here vanishes if |1| > 0. To see 
this, make an expansion of (z + w)*, and set z = (i,...,7), w = (—i,...,—-7).) 


4. Elliptic operators and parametrices 17 
4. Elliptic operators and parametrices 
We say p(x, D) € OPS". is elliptic if, for some r < 00, 
(4.1) lp(z,€)*| < C(E)-™, for |€| > r- 


Thus, if #(€) € C'°(R”) is equal to 0 for |€| < r, 1 for |€| > 2r, it follows easily 
from the chain rule that 


(4.2) W()p(a, €)~* = go(#,€) € S55". 


As long as 0 < 6 < p < 1, we can apply Proposition 3.3 to obtain 


(43) go(x, D)p(x, D) = I+ ro(a,D), 
p(x, D)qo(x,D) =I +Fo(x,D), 

with 

(4.4) ro(,€), Fo(a,€) € SSP. 


Using the formal expansion 

(4.5) I—ro(a, D) + r(x, D)? —--- ~~ 1+ s(x, D) € OPS?) ; 
and setting g(a, D) = (I + s(x, D))qo(x, D) € OPS), 3", we have 
(4.6) q(a,D)p(a,D)=I+r(a,D), r(a,é)es®. 
Similarly, we obtain g(a, D) € OPS, 5 satisfying 

(4.7) p(a,D)q(a,D)=1+7(a,D), F(a,é)EeS°%. 
But evaluating 

(4.8) (q(x, D)p(@, D)) Ga, D) = q(x, D) (p(a, D)G(a, D)) 
yields q(x, D) = G(x, D) mod OPS~~, so in fact 


g(a, D)p(a,D) =I mod OPS" ®, 


(4.9) 7 
p(«, D)q(a,D) =I mod OPS-”~. 


We say that q(x, D) is a two-sided parametrix for p(x, D). 
The parametrix can establish the local regularity of a solution to 


18 


(4, 


7. Pseudodifferential Operators 


10) p(«, D)u= f. 


Suppose u, f € S’(R”) and p(x, D) € OPS", is elliptic, withO <6 <p <1. 
Constructing q(x, D) € OPS7, 5" as in (4.6), we have 


(4. 


11) u= q(a,D)f —r(a, D)u. 


Now a simple analysis parallel to (1.7) implies that 


(4, 


12) ReEOPS-* = R:€'—S. 


By duality, since taking adjoints preserves OPS~ °° 


(4. 


13) REOPS-~ => R:S'—C®. 


Thus (4.11) implies 


(4. 


14) u= q(«,D)f mod C™. 


Applying the pseudolocal property to (4.10) and (4.14), we have the following 
elliptic regularity result. 


Proposition 4.1. [f p(x, D) € OPS; is elliptic and 0 < 6 < p < 1, then, for 
any u € S’(R"), 


(4. 


15) sing supp p(x, D)u = sing supp u. 


More refined elliptic regularity involves keeping track of Sobolev space regu- 


larity. As we have the parametrix, this will follow simply from mapping properties 
of pseudodifferential operators, to be established in subsequent sections. 


Exercises 


1. 


Give the details of the implication (4.1) = (4.2) when p(x, €) € Sp'5,0<d<p<l. 
Include the case where p(x, €) is a k x k matrix-valued function, using such identities 
as 


0 1 op -1 
oo wla.€)-! = ple)? ZF plaey 


. On R x R”, consider the operator P = 0/0t — Ae De), where 


De) = S~aje(x) JjOku+ Yb; (x) Oyu + e(a)u. 


Assume that the coefficients are smooth and bounded, with all their derivatives, and that 
L satisfies the strong ellipticity condition 


—La(x,£) = Y~ aje(a)éjex > Clél?, C >. 


5. L?-estimates 19 


Show that 
(ir — Lo(w,€) +1)" = E(t, 2,7, €) € Sho. 
Show that E(t,z,D)P = Ai(t,x,D) and PE(t,x,D) = A2(t,x,D), where 
A; € OPS} /2,0 are elliptic. Then, using Proposition 4.1, construct a parametrix for 
P, belonging to OPS Tai 
3. Assume —n < m < 0, and suppose P = p(x,D) € OPS? has Schwartz kernel 
K(a,y) = L(x, x — y). Suppose that, at zo € R”, 


L(xo,2z) ~ alz| 0" " +. ? z— 0, 
with a # 0, the remainder terms being progressively smoother. Show that 


Dm(Xo; §) = blg|””, b # 0, 


and hence that P is elliptic near xo. 

4. Let P = (Pj,) bea K x K matrix of operators in OPS". It is said to be “elliptic in 
the sense of Douglis and Nirenberg” if there are numbers a;,b;, 1 < 7 < K, such that 
Pj, € OPS7i** and the matrix of principal symbols has nonvanishing determinant 
(homogeneous of order }*(a; + b;)), for € # 0. If A® is as in (1,17), let Abea K x K 
diagonal matrix with diagonal entries A~°/, and let B be diagonal, with entries A~°s. 
Show that this “DN-ellipticity” of P is equivalent to the ellipticity of APB in OPS®. 


5. L?-estimates 


Here we want to obtain L?-estimates for pseudodifferential operators. The fol- 
lowing simple basic estimate will get us started. 


Proposition 5.1. Let (X, 1) be a measure space. Suppose k(x, y) is measurable 
on X x X and 


(5.1) i \k(2e,y)| dul) < Cr, / l(a, 4)| due(y) < Cr, 
xX x 


for all y and x, respectively. Then 


(5.2) Tu(x) = f kx,u)uly) dyty) 
satisfies 
(5.3) |Tullze < Cy’? Cy! llullze, 


for p € [1, co, with 


(5.4) Phere, 


20 7. Pseudodifferential Operators 
This is proved in Appendix A on functional analysis; see Proposition 5.1 there. 


To apply this result when X = R” and k = K is the Schwartz kernel of p(w, D) € 
OPS*";, note from the proof of Proposition 2.1 that 


(5.5) |K(a,y)| <Cn|a— yl", forle—yl 21 
as long as p > 0, while 
(5.6) |K(2,y)| <Cla—yl"O™, for ja —y| <1 


as long as m < —n + p(n — 1). (Recall that this last estimate is actually rather 
crude.) Hence we have the following preliminary result. 


Lemma 5.2. [f p(x, D) € OPS7;, p > 0, andm < —n + p(n —1), then 
(5.7) p(z, D) : L?(R”) — D?(R"), L<p<o. 
If p(x, D) € OPS", then (5.7) holds form <0. 


The last observation follows from the improvement of (5.6) given in (2.5). 
Our main goal in this section is to prove the following. 


Theorem 5.3. If p(x,D) € OPS? 5 and0 <5 <p <1, then 
(5.8) p(x, D) : L?(R") — L7(R"). 
The proof we give, following [Ho5], begins with the following result. 
Lemma 5.4. If p(x, D) € OPS) 5; 0<d6<p<1,anda > 0, then (5.8) holds. 
Proof. Since ||Pu||7. = (P*Pu,u), it suffices to prove that some power of 


p(x, D)*p(x,D) = Q is bounded on L?. But Q* € OS.5 so for k large 
enough this follows from Lemma 5.2. 


To proceed with the proof of Theorem 5.3, set q(x,D) = p(x, D)*p(a, D) 
€ OPS? 5, and suppose |q(x,&)| < M — 6, b > 0, so 


(5.9) M — Req(z,€) >b>0. 

In the matrix case, take Re q(x, €) = (1/2) (q(x, €) + q(a, €)*). It follows that 
(5.10) A(x, €) = (M — Re q(2,8))"" € $9, 

and 


(5.11) A(a,D)*A(a,D) = M—q(x,D)+r(x,D), r(2,D) € OPS," 


5. L?-estimates 21 


Applying Lemma 5.4 to r(a, D), we have 


(5.12) 
M|lullZ2 — |lp(a, D)ullz2 = || A(@, D)ullZ2 — (r(@, D)u, u) > —Cllullz., 


or 
(5.13) lIp(x, D)ull? < (M + C)llullze, 
finishing the proof. 

From these L?-estimates easily follow L?-Sobolev space estimates. Recall 
from Chap. 4 that the Sobolev space H*(R”) is defined as 
(5.14) H*(R") = {u€ S’(R”) : (€)°a(€) € L?(R”)}. 


Equivalently, with 


(5.15) Mus [erawer® dé; APEC OPS, 
we have 
(5.16) H*(R") = A~*L?(R"). 


The operator calculus easily gives the next proposition: 
Proposition 5.5. [fp(x,D) € OPS?';,0<6<p<1,m,s€R, then 
(5.17) p(a, DD): H°(R") — He-™(R™). 


Given Proposition 5.5, one easily obtains the Sobolev regularity of solutions to 
the elliptic equations studied in §4. 

Calderon and Vaillancourt sharpened Theorem 5.3, showing that 
(5.18) p(x, €) € S, OS p< 1 => p(z, D) : L?(R") — L*(R”). 

This result, particularly for p = 1/2, has played an important role in linear PDE, 
especially in the study of subelliptic operators, but it will not be used in this book. 
The case p = 0 is treated in the exercises below. 

Another important extension of Theorem 5.3 is that p(a,D) is bounded on 
L?(R"), for 1 < p < co, when p(x, €) € S) 5. Similarly, Proposition 5.5 extends 
to a result on L?-Sobolev spaces, in the case p = 1. This is important for applica- 
tions to nonlinear PDE, and will be proved in Chap. 13. 


22 ‘7. Pseudodifferential Operators 


Exercises 


Exercises 1-7 present an approach to a proof of the Calderon-Vaillancourt theorem, 
(5.18), in the case p = 0. This approach is due to H. O. Cordes [Cor]; see also T. Kato 
[K] and R. Howe [How]. In these exercises, we assume that U(y) is a (measurable) 
unitary, operator-valued function on a measure space Y,, operating on a Hilbert space H. 
Assume that, for f,g € V, a dense subset of 1, 


(5.19) [we | |? dm(y) = Coll? llall2. 


1. Let yo € H be a unit vector, and set y, = U(y) yo. Show that, for any T' € L(H), 


(5.20) CTA, fa) = / / Lr(uy’) (fis ey’) (Gus f2) dm(y) dm(y’), 


where 


(5.21) Lr(y,y') = (Tey', y)- 


(Hint: Start by showing that [ (fi, Gy)(y, f2) dm(y) = Co(fi, f2)-) 
A statement equivalent to (5.20) is 


(5.22) T= / | Lr(y,y') Uy) Bol (y’) dm(y) amily’), 


where ®o is the orthogonal projection of 1 onto the span of yo. 
2. Fora partial converse, suppose L is measurable on Y x Y and 


(5.23) / L(y, ¥')| dm(y) < Ch, / E(u, ¥')| dm(y') < Cr. 
Define 
(5.24) T, = : pe (y, y’) U(y)®oU (y')* dm(y) dm(y’). 


Show that the operator norm of T, on 7 has the estimate 
[Tl] < CoCi. 


3. If Gis a trace class operator, and we set 


(5.25) fee / i (y,y') U(y)GU(y')* dm(y) dm(y’), 
show that 
(5.26) \[Tz,4l| < CFC4 ||Glze. 


(Hint: In case G = G, diagonalize G' and use Exercise 2.) 
4. Suppose b € L°°(Y) and we set 


(5.27) ie / b(y) U(y)GU(u)* amy). 


Show that 


6. Garding’s inequality 23 
(5.28) Zell < Colldllz~ ||Glne. 


5. Let Y = R?”, with Lebesgue measure, y = (q,p). Set U(y) = e'%*%e'P? = 
7(0,q,p), as in Exercises 1 and 2 of §1. Show that the identity (5.19) holds, for 
fig € L?(R") = H, with Co = (27)~”. (Hint: Make use of the Plancherel theo- 
rem.) 

6. Deduce that if a(x, D) is a trace class operator, 


(5.29) I| (6 * a)(@, D)\Lez2) S Calle |la(w, D)|Irr. 


(Hint: Look at Exercises 3-4 of §1.) 
7. Suppose p(x, €) € 99,9. Set 


(5.30) a(,€) = p(a)p(é), (av, €) = (1— Az)*(1— Ae)* (a, €), 


where k is a positive integer, ~)(€) = (€)~?*. Show that if k is chosen large enough, 
then a(x, D) is trace class. Note that, forall k € Z*,b € L©(R?”), provided p € S§o. 
Show that 


(5.31) p(x, D) = (b* a)(z, D), 


and deduce the p = 0 case of the Calderon-Vaillancourt estimate (5.19). 
8. Sharpen the results of problems 3-4 above, showing that 


(5.32) I[Tr.cllcoy < C6 I|Lllecc2cvy [IGlire- 


This is stronger than (5.26) in view of Proposition 5.1. 


6. Garding’s inequality 


In this section we establish a fundamental estimate, first obtained by L. Garding 
in the case of differential operators. 


Theorem 6.1. Assume p(x,D) € OP os d6<p<1,and 
(6.1) Re p(x,§) > C\g|", for |g] large. 

Then, for any s € R, there are Cy, C; such that, for u € H™/?(R"), 
(6.2) Re (p(x, D)u,u) > Collellm/2 — Crledl7ze- 


Proof. Replacing p(x, D) by A~"”/?p(x, D)A~™/?, we can suppose without loss 
of generality that m = 0. Then, as in the proof of Theorem 5.3, take 


1 .\1/2 
(6.3) A(z, £) = (Repl2,6)— 5C) € S05, 


so 


24 7. Pseudodifferential Operators 


A(a, D)* A(x, D) = Re p(a, D) — 5+ r(a,D), 


(6.4) 
r(x,D) € OPS, 
This gives 
1 
sg OD |A(e, DyulZs + Cllullda + (r(e, D)u, u) 


1 
> 5Cllullie — Callull 


with s = —(p—6)/2, so (6.2) holds in this case. If s < —(p—6)/2 = so, use the 
simple estimate 


(6.6) llullFrs0 < Ellullz2 + C(e)|lulla. 
to obtain the desired result in this case. 


This Garding inequality has been improved to a sharp Garding inequality, of 
the form 


(6.7) Re (p(x, D)u,u) > —C|lul|Z2_ when Re p(x, €) > 0, 


first for scalar p(z,€) € Sito, by Hérmander, then for matrix-valued symbols, 
with Re p(x, €) standing for (1/2) (p(x, £)+p(z, €)*), by P. Lax and L. Nirenberg. 
Proofs and some implications can be found in Vol. 3 of [Ho5], and in [T1] and 
[Tre]. A very strong improvement due to C. Fefferman and D. Phong [FP] is that 
(6.7) holds for scalar p(x, €) € cee See also [Ho5] and [F] for further discussion, 
and [T6] for an application to wave propagation. 


Exercises 


1. Suppose m > 0 and p(x, D) € OPS‘"p has a symbol satisfying (6.1). Examine the 
solvability of 
ot = ple, Dyu, 
for u = u(t, x), u(0, 7) = f € H*(R”). 
(Hint: Look ahead at §7 for some useful techniques. Solve 


Oue 
Ot 


= ePp(x, D)Jeve 


and estimate (d/dt)|| A°ue(t)||7,2, making use of Garding’s inequality.) 


7. Hyperbolic evolution equations 25 
7. Hyperbolic evolution equations 


In this section we examine first-order systems of the form 


(7.1) ot = L(t,2,De)u+ g(t,2), u(0) = 


We assume L(t, 7,&€) € Sto, with smooth dependence on t, so 
(7.2) |D{ DEDZL(t, x, €)| S Cjaa(é)*'". 


Here L(t,z,&) isa K x K matrix-valued function, and we make the hypothesis 
of symmetric hyperbolicity: 


(7.3) L(t, x, €)* + L(t, x, €) € SP. 


We suppose f € H*(R"), s € R,g € C(R, H°(R”)). 
Our strategy will be to obtain a solution to (7.1) as a limit of solutions uz to 


Ouse 
(7.4) at = JI-LIeUe + 9, ue (0) =f, 
where 
(7.5) Jz = p(eD,), 


for some y(€) € S(R”), y(0) = 1. The family of operators J- is called a 
Friedrichs mollifier. Note that, for any ¢ > 0, J: € OPS~@, while, for e € (0, 1], 
J- is bounded in OPS? 9. 

For any € > 0, J-LJ- is a bounded linear operator on each H*, and solvability 
of (7.4) is elementary. Our next task is to obtain estimates on u-, independent of 
€ € (0, 1]. Use the norm ||u||z75 = ||A*ul| p2. We derive an estimate for 


(7.6) © |.A*ue(t)|2. = 2Re(A°J-LJete, A°uc) + 2 Re (A*g, A*ue). 
Write the first two terms on the right as the real part of 

(7.7) 2( LAP Jette, A° Jee) + 2([A*, L] Jette, A° Tete). 

By (7.3), b+ L* = B(t,x, D) € OPS? 0, so the first term in (7.7) is equal to 
(7.8) (B(t, 2, D)A® Jee, AS Jette) < Cll Jevellizs- 


Meanwhile, [A*, L] € OPS} 9, so the second term in (7.7) is also bounded by the 
right side of (7.8). Applying Cauchy’s inequality to 2(A*g, A®u_), we obtain 


26 7. Pseudodifferential Operators 
d s 2 s 2 2 
(7.9) all Arve) Ilz2 < CAP ue()IIz2 + Clla@ liz. 


Thus Gronwall’s inequality yields an estimate 


(7.10) [ue (t) Fre < C(t) [If llzz> + lgllecqo.4,2r6)] > 


independent of e € (0, 1]. We are now prepared to establish the following exis- 
tence result. 


Proposition 7.1. [f (7.1) is symmetric hyperbolic and 

f ¢ H°(R"), g © C(R,H°(R”)), seER, 
then there is a solution u to (7.1), satisfying 
(7.11) u € L&(R, H*(R")) N Lip (R, H*—*(R”)). 
Proof. Take J = [—T,T]. The bounded family 


“€ OU,B iC U,A™) 


will have a weak limit point u satisfying (7.11), and it is easy to verify that such u 
solves (7.1). As for the bound on [—T,, 0], this follows from the invariance of the 
class of hyperbolic equations under time reversal. 


Analogous energy estimates can establish the uniqueness of such a solution wu 
and rates of convergence of uz + uas € — 0. Also, (7.11) can be improved to 


(7.12) u € C(R, H*(R”))N C!(R, H*—*(R")). 


To see this, let f; € Hst}, f; — f in H®, and let u; solve (7.1) with u;(0) = 
fj. Then each u; belongs to LP°(R, H**!) 1 Lip(R, H*), so in particular each 
uj € C(R,H*). Now v; = u— u,; solves (7.1) with v;(0) = f — f;, and 
\|f — fjllas — 0 as 7 — oo, so estimates arising in the proof of Proposition 7.1 
imply that ||v;(t)|| 7» — 0 locally uniformly in t, giving u € C(R, H*). 

There are other notions of hyperbolicity. In particular, (7.1) is said to be sym- 
metrizable hyperbolic if there is a K x K matrix-valued S(t,x,€) € So that 
is positive-definite and such that S(t,x,€)L(t,a,€) = L(t,,€) satisfies (7.3). 
Proposition 7.1 extends to the case of symmetrizable hyperbolic systems. Again, 
one obtains wu as a limit of solutions u, to (7.4). There is one extra ingredient in 
the energy estimates. In this case, construct S(t) € OPS) 5, positive-definite, 


with symbol equal to S(t, x,€) mod S7 a For the energy estimates, replace the 
left side of (7.6) by 


(7.13) — (A®ue(t), S(t)A°u,(t)) 


7. Hyperbolic evolution equations 27 


which can be estimated in a fashion similar to (7.7)-(7.9). 

A K x K system of the form (7.1) with L(t, z,€) € S$ is said to be strictly 
hyperbolic if its principal symbol L(t, x, €), homogeneous of degree 1 in €, has 
K distinct, purely imaginary eigenvalues, for each x and each € # 0. The results 
above apply in this case, in view of: 


Proposition 7.2. Whenever (7.1) is strictly hyperbolic, it is symmetrizable. 


Proof. If we denote the eigenvalues of L1(t,x,&) by iA, (t,2,&), ordered so 
that y(t, 7,6) <-++ < Ax(t,x,§), then A, are well-defined C'™°-functions of 
(t,,€), homogeneous of degree 1 in €. If P,(t, x, €) are the projections onto the 
ir -eigenspaces of Ly, 


(7.14) P,(t,2,€) = aq [ (6-29) d, 
Ye 


where ¥, is a small circle about 7A, (t, x, €), then P, is smooth and homogeneous 
of degree 0 in €. Then 


(7.15) S(t,2,€) = >> Pj(t,, 6)" P;(¢,2, €) 


J 
gives the desired symmetrizer. 


Higher-order, strictly hyperbolic PDE can be reduced to strictly hyperbolic, 
first-order systems of this nature. Thus one has an analysis of solutions to such 
higher-order hyperbolic equations. 


Exercises 


1. Carry out the reduction of a strictly hyperbolic PDE of order m to a first-order system 
of the form (7.1). Starting with 


oO” u Ou 
Lu = Oy™ oF > Ai(y,@, De) a 
j=0 
where A; (y, 2, D) has order < m — j, form v = (v1,...,Um)* with 
Sg, 2 eS A. Ai = OG; 
to pass from Lu = f to 
Ov 


with F = (0,...,0, f)*. Give an appropriate definition of strict hyperbolicity in this 
context, and show that this first-order system is strictly hyperbolic provided L is. 
2. Fix r > 0. Let y € €’(IR®) denote the unit mass density on the circle of radius r: 


28 7. Pseudodifferential Operators 


i Tv 
(u, Ir) = on i u(r cos 6,rsin 6) dé. 


Let '.u = 7 * u. Show that there exist A,(€) € S~1/?(R?) and B,(€) € S1/?(R?), 
such that 


(7.16) fe AD cer /—A EBD) Se as. 


(Hint: See Exercise | in §7 of Chap. 6.) 


8. Egorov’s theorem 


We want to examine the behavior of operators obtained by conjugating a pseudod- 
ifferential operator Po € OPS7"o by the solution operator to a scalar hyperbolic 
equation of the form 


Ou 


(8.1) a iA(t, v7, D,)u, 


where we assume A = A, + Ag with 
(8.2) Ax(t,x,€) € Sh real, Ao(t, x, €) € SQ. 


We suppose Aj(t, x, €) is homogeneous in €, for || > 1. Denote by S(t, s) the 
solution operator to (8.1), taking u(s) to u(t). This is a bounded operator on each 
Sobolev space H’, with inverse S(s, t). Set 


(8.3) P(t) = S(t,0)PoS(0, t). 
We aim to prove the following result of Y. Egorov. 


Theorem 8.1. Jf Po = po(a,D) € OPS%", then for each t, P(t) © OPS", 
modulo a smoothing operator. The principal symbol of P(t) (mod eG ata 


point (Xo, 0) is equal to po(yo, No), where (yo, o) is obtained from (xo, &o) by 
following the flow C(t) generated by the (time-dependent) Hamiltonian vector 
field 


"(0A, 0 OA, O 
(8.4) Tics = ( ). 
Ax (#,2,8) y 0&; Ox; Ox; OE; 


To start the proof, differentiating (8.3) with respect to ¢ yields 
(8.5) P’(t) =i{A(t,2, D), P(t)], P(0) = Pp. 


We will construct an approximate solution Q(t) to (8.5) and then show that Q(t) — 
P(t) is a smoothing operator. 


8. Egorov’s theorem 29 
So we are looking for Q(t) = q(t, 7, D) € OPS7"%o, solving 
(8.6) Q'(t) = [A(t x, D),Q()] + R(t), Q(0) = Po, 


where R(t) is a smooth family of operators in OPS~°. We do this by construct- 
ing the symbol q(t, x, €) in the form 


(8.7) g(t, 2, €) ~ qolt, 2, €) + a(t, 2, €) +--+: : 
Now the symbol of i[A, Q(t)] is of the form 


gol 7 ' 
(8.8) Hag {4o,q} +i D> (AM aa) - 4 Aca), 


|a|>2 


where A‘) = D&A, A(q) = DA, and so on. Since we want the difference 
between this and 0q/0t to have order —oo, this suggests defining go(t, x, €) by 


O 
(8.9) e = Ha, )qo(t, ©, 8) = 0, qo(0, 2, €) = po(x, €). 


Thus go(t, 20,0) = Po(yo,7o), as in the statement of the theorem; we have 
qo(t, x, €) € Sj". Equation (8.9) is called a transport equation. Recursively, we 
obtain transport equations 


a) 
(810 (5 — Ha, )aylt,2,€) = bi(¢,2,6), (0,26) = 0, 


for 7 > 1, with solutions in ST 0 j , leading to a solution to (8.6). 
Finally, we show that P(t) — Q(#) is a smoothing operator. Equivalently, we 
show that, for any f © H7(R"), 


(8.11) u(t) — w(t) = S(t, 0) Pof — Q(t) S(t, 0) f € H°(R"), 
where H°°(R") = 1,H*(R”). Note that 


(8.12) a =iA(t,x,D)v, (0) = Pof, 


while use of (8.6) gives 


(8.13) a =iA(t,c,D)w+g, w(0) = Pof, 
where 
(8.14) g = R(t) S(t, 0)w € C™(R, H(R”)). 


Hence 


30 7. Pseudodifferential Operators 


(8.15) c —w) =iA(t,z,D)(v-—w)—g, v(0) — w(0) =0. 


Thus energy estimates for hyperbolic equations yield v(t) — w(t) € H®™, for any 
f € H°(R"), completing the proof. 
A check of the proof shows that 


(8.16) Py € OPS => P(t) € OPS. 


Also, the proof readily extends to yield the following: 
Proposition 8.2. With A(t, x, D) as before, 


(8.17) Py € OPSI'; => P(t) € OPS", 
provided 

1 
(8.18) p> rt d=1-p. 


One needs 6 = 1 — p to ensure that p(C(t)(x, £)) € S7";, and one needs p > 6 
to ensure that the transport equations generate q;(t,x,€) of progressively lower 
order. 


Exercises 


1. Let x : R” — R” be a diffeomorphism that is a linear map outside some compact set. 
Define x* : C*°(R") > C™(R”) by x* f(x) = f (x()). Show that 


(8.19) P € OPS¥"y => (x*) ' Px* € OPS%o. 


(Hint: Reduce to the case where x is homotopic to a linear map through diffeomor- 
phisms, and show that the result in that case is a special case of Theorem 8.1, where 
A(t, x, D) is a t-dependent family of real vector fields on R”.) 

2. Leta € C§°(R"), gp € C™(R”) be real-valued, and Vp 4 0 on supp a. If P € 
OPS", show that 


(8.20) P(ae*?) =b(@,d) e, 
where 
(8.21) b(a, A) ~ A” [by (a) + OF (w)AT* +--+], A— oo. 


(Hint: Using a partition of unity and Exercise 1, reduce to the case p(x) = x - €, for 
some € € R” \ 0.) 

3. If a and y are as in Exercise 2 above and I’, is as in Exercise 2 of §7, show that, mod 
O(rA-*), 


sinr/—A 


(8.22) T,(ae*) = cosr¥—A(A,(a, A)e*) + ak 


(Bees, Ae), 


9. Microlocal regularity 31 


where 


A,(a,A) ~ » *9 a= (cb) haz. (@)A Pye], 
B(x, r) Bes Nee [b5;- (x) + by.(x)A wes a 


as X — boo. 


9. Microlocal regularity 


We define the notion of wave front set of a distribution u ¢ H7~®@(R”) = 
U,H*(R”), which refines the notion of singular support. If p(a,é) € S™ has 
principal symbol p,,(x,€), homogeneous in €, then the characteristic set of 
P = p(a, D) is given by 


(9.1) Char P = {(z,€) € R” x (R” \ 0) : pm (x, €) = Of. 


If Dm(a,€) is a K x K matrix, take the determinant. Equivalently, (a9, €) is 
noncharacteristic for P, or P is elliptic at (xo, £0), if |p(a,€)~*| < C\E|-™, for 
(x, €) in a small conic neighborhood of (20,5) and |€| large. By definition, a 
conic set is invariant under the dilations (x, £) ++ (a,7r&), r € (0,00). The wave 
front set is defined by 


(9.2) WF(u) = (){Char P: P € OPS°, Pue C™}. 


Clearly, WF(u) is a closed conic subset of R” x (R” \ 0). 


Proposition 9.1. [f 7 is the projection (x,&) ++ x, then 
1(WF(u)) = sing supp u. 


Proof. If zo ¢ sing supp wu, there is a p € CG°(R”), yp = 1 near xo, such that 
pu € C§°(R”). Clearly, (zo, €) ¢ Char y for any € 4 0, so m(WF'(u)) C sing 
supp u. 

Conversely, if zo ¢ 7(W F(u)), then for any € 4 0 there isa Q € OPS® such 
that (xo, €) ¢ Char Q and Qu € C™. Thus we can construct finitely many Q; € 
OPS® such that Q;u € C® and each (xo, €) (with |€| = 1) is noncharacteristic 
for some Qj. Let Q = 7 QjQ; € OPS°. Then Q is elliptic near x9 and Qu € 
C™, so wis C'° near xo. 


We define the associated notion of ES(P) for a pseudodifferential operator. Let 
U be an open conic subset of R” x (IR" \ 0). We say that p(x, €) € S'; has order 
—oo on U if for each closed conic set V of U we have estimates, for each NV, 


(9.3) |D? De p(x, €)| < Caanv(€)-%, (2, €) € V. 


32 7. Pseudodifferential Operators 


If P = p(x, D) € OPS", we define the essential support of P (and of p(x, €)) 
to be the smallest closed conic set on the complement of which p(x, €) has order 
—oo. We denote this set by ES(P). 

From the symbol calculus of §3, it follows easily that 


(9.4) ES(P,P2) C ES(P,) N ES(P2) 


provided P; € OPS) 5, and p; > 69. To relate WF(Pu) to WF(u) and ES(P), 
we begin with the following. 


Lemma 9.2. Let u € H~®(R”), and suppose that U is a conic open set 
satisfying 
WF(u) NU = 90. 


If P € OPS™., p > 0,6 <1, and ES(P) CU, then Pu € C™. 


po? 
Proof. Taking Py € OPS®° with symbol identically 1 on a conic neighborhood 
of ES(P), so P = PP mod OPS”, it suffices to conclude that Pou € C'™, so 
we can specialize the hypothesis to P € OPS”. 

By hypothesis, we can find Q; € OPS® such that Qju € C® and each 
(x,€) € ES(P) is noncharacteristic for some Q;, and if Q = >) Q7Q;, then 
Qu € C™ and Char QM ES(P) = 0. We claim there exists an operator A € 
OPS® such that AQ = P mod OPS~~. Indeed, let Q be an elliptic operator 
whose symbol equals that of Q on a conic neighborhood of ES(P), and let o 
denote a parametrix for Q. Now simply set A = PO. Consequently, (mod C'°) 
Pu = AQu € C®, so the lemma is proved. 


We are ready for the basic result on the preservation of wave front sets by a 
pseudodifferential operator. 


Proposition 9.3. [fu ¢ H~© and P € OPS"™., with p > 0, 6 < 1, then 


p,0? 
(9.5) WF(Pu) C WF(u) N ES(P). 


Proof. First we show WF(Pu) C ES(P). Indeed, if (%o,&) ¢ ES(P), choose 
Q = q(a,D) € OPS® such that q(z,€) = 1 on a conic neighborhood of 
(xo, €£9) and ES(Q) N ES(P) = 0. Thus QP € OPS~~, so QPu € C®™. Hence 
(xo, £0) € WF(Pu). 

In order to show that WF(Pu) C WF(w), let I be any conic neighborhood of 
WF(u), and write P = P, + Po, Pj © OPS", with ES(P,) C TP and ES(P2)M 
WE(u) = @. By Lemma 9.2, Pou € C™. Thus WF(u) = WF(Piu) CT, which 
shows WF( Pu) C WF(w). 


One says that a pseudodifferential operator of type (p,6), with p > O and 
6 < 1, is microlocal. As a corollary, we have the following sharper form of local 
regularity for elliptic operators, called microlocal regularity. 


9. Microlocal regularity 33 
Corollary 9.4. If P € OPS". is elliptic, 0 < 6 < p <1, then 
(9.6) WF(Pu) = WF(u). 


Proof. We have seen that WF(Pu) C WF(u). On the other hand, if E ¢€ 
OPS, 5° is a parametrix for P, we see that WF(u) = WF(E Pu) C WF(Pu). In 
fact, by an argument close to the proof of Lemma 9.2, we have for general P that 


(9.7) WFE(u) C WF(Pu) U Char P. 


We next discuss how the solution operator e’’4 to a scalar hyperbolic equation 
Ou/Ot = iA(a, D)u propagates the wave front set. We assume A(zx,£) € Sh, 
with real principal symbol. Suppose WF(u) = &. Then there is a countable family 
of operators p;(x,D) € OPS°, each of whose complete symbols vanishes in a 


neighborhood of 4, but such that 


(9.8) r= (i) pine) 00) 


We know that p,;(z,D)u € C™ for each j. Using Egorov’s theorem, we want 
to construct a family of pseudodifferential operators g;(x,D) € OPS® such that 
qy (2, Dye**4u € C©, this family being rich enough to describe the wave front 
set of eu. 

Indeed, let qj(2,D) = e”4p;(a, D)e~*4. Egorov’s theorem implies that 
qj(x, D) € OPS® (modulo a smoothing operator) and gives the principal symbol 
of q(x, D). Since p;(x,D)u € C™, we have e*4p,(a, D)u € C™, which in 
turn implies q;(x, D)e**4u € C%. From this it follows that WF(e4u) is con- 
tained in the intersection of the characteristics of the q;(x, D), which is precisely 
C(t)’, the image of 7 under the canonical transformation C(t), generated by 
f4,. In other words, 

WF(e"4u) C C(t)WEF(u). 
However, our argument is reversible; u = e~*4(e*4u) 
the following result: 


. Consequently, we have 


Proposition 9.5. If A= A(x,D) € OPS" is scalar with real principal symbol, 
then, foru€ H-™, 


(9.9) WF(e"'4u) = C(t) WF(u). 


The same argument works for the solution operator S(¢,0) to a time- 
dependent, scalar, hyperbolic equation. 


34 7. Pseudodifferential Operators 


Exercises 


1. Ifa € Co°(R"), gy € C™(R”) is real-valued, Vp 4 0 on supp a, as in Exercise 2 of 
§8, and P = p(x, D) € OPS™, so 


P(ae™*) = ba, Aje*™*™, 


as in (8.20), show that, mod O(|A|~°°), b(a, A) depends only on the behavior of p(x, €) 
on an arbitrarily small conic neighborhood of 


Cy = {(a,Adp(x)) : x € supp a, A # OF. 


If cr is the subset of C, on which \ > 0, show that the asymptotic behavior of 
b(xz, A) as A — +00 depends only on the behavior of p(x, €) on an arbitrarily small 
conic neighborhood of cy . 

2. If T is as in (8.22), show that, given r > 0, 


(9.10) (cosr/—A) (ae) =T,Q,(a ee”), mod O(A~™), A> 0, 


for some Q, € OPS"/?. Consequently, analyze the behavior of the left side of (9.10), 
as \ — +00, in terms of the behavior of I’, analyzed in 87 of Chap. 6. 


10. Operators on manifolds 


Let M be a smooth manifold. It would be natural to say that a continuous linear 
operator P : Cg°(M) — D'(M) is a pseudodifferential operator in OPS7s(M) 
provided its Schwartz kernel is C' off the diagonal in M x M, and there exists 
an open cover (; of MM, a subordinate partition of unity y;, and diffeomorphisms 
F, : Q; + O; C R” that transform the operators y, Py; : C°(Q;) > E'(Qe) 
into pseudodifferential operators in OPS7;, as defined in §1. 

This is a rather “liberal” definition of ‘OPS? ™m.(M). For example, it poses no 
growth restrictions on the Schwartz kernel é ‘DM x M) at infinity. Conse- 
quently, if 17 happens to be R”, the class a operators in OPS™ ;(M/) as defined 
above is a bit larger than the class OPS/"; defined in §1. One negative conse- 
quence of this definition is that pseudodifferential operators cannot always be 
composed. One drastic step to fix this would be to insist that the kernel be prop- 
erly supported, so P : C§°(M) — C§°(M). If M is compact, these problems do 
not arise. If IW is noncompact, it is often of interest to place specific restrictions 
on K near infinity, but we won’t go further into this point here. 

Another way in which the definition of OP nM ) given above is liberal is 
that it requires P to be locally transformed to pseudodifferential operators on R” 
by some coordinate cover. One might ask if then P is necessarily so transformed 
by every coordinate cover. This comes down to asking if the class OP Sis defined 
in §1 is invariant under a diffeomorphism F' : R” — R”. It would suffice to 
establish this for the case where F' is the identity outside a compact set. 

In case p € (1/2,1] and 6=1 — 9, this invariance is a special case of the 
Egorov theorem established in §8. Indeed, one can find a time-dependent vec- 
tor field X(t) whose flow at t = 1 coincides with F and apply Theorem 8.1 to 


10. Operators on manifolds 35 


iA(t, x, D) = X(t). Note that the formula for the principal symbol of the conju- 
gated operator given there implies 


(10.1) p(1, F(x), €) = po(2, F’(x)*€), 


so that the principal symbol is well defined on the cotangent bundle of /. 

We will therefore generally insist that p € (1/2,1] and 6 = 1 — p when 
talking about OPS7"; (M) for a manifold M, without a distinguished coordinate 
chart. In special situations, it might be natural to use coordinate charts with special 
structure. For instance, for a Cartesian product M = R x Q, one can stick to 
product coordinate systems. In such a case, we can construct a parametrix E for 
the hypoelliptic operator 0/0t — A,., t € R, x € Q, and unambiguously regard E 
as an operator in OPS7,.(R x Q). 

We make the following comments on the principal symbol of an operator P € 
OPSi"5(M), when p € (1/2, 1], 6 = 1—p. By the arguments in §8, the principal 
symbol is well defined, if it is regarded as an element of the quotient space: 


(10.2) p(x, £) € 8™(T*M) (9m; PP) (T* M). 
In particular, by Theorem 8.1, in case P € OPS7")(M ), we have 
(10.3) p(x, €) € S1%(T*M)/ST *(T*M). 


If P € S""(M), then the principal symbol can be taken to be homogeneous in € 
of degree m, by (8.16). Note that the characterizations of the Schwartz kernels of 
operators in OPSj"o and in OPS?) given in §2 also make clear the invariance of 
these classes under coordinate transformations. 

We now discuss some properties of an elliptic operator A € OPS{"o(M), 
when M is a compact Riemannian manifold. Denote by B a parametrix, so we 
have, for each s € R, 


(10.4) A: H*+™(M) —> H*(M), B:H*(M)— H*+™(M), 


and AB = 1+ ki, BA = I+ Ko, where K; : D'(M) — C®(M). Thus 
kK; is compact on each Sobolev space H*(M), so B is a two-sided Fredholm 
inverse of A in (10.4). In particular, A is a Fredholm operator; ker A = K4m C 
H**™(M) is finite-dimensional, and A(H**™(M)) C H*(M) is closed, of 
finite codimension, so 


C, = {v € H-*(M) : (Au,v) = 0 for allu € H*t™(M)} 
is finite-dimensional. Note that C, is the null space of 


(10.5) A*: H7*(M) — H7*-™(M), 


36 7. Pseudodifferential Operators 


which is also an elliptic operator in OPS7"y(M). Elliptic regularity yields, for 
all s, 


(10.6) Ks4m = {ue C?(M): Au=0}, C,= {ve C™%(M): A*v=O}. 


Thus these spaces are independent of s. 
Suppose now that m > 0. We will consider A as an unbounded operator on the 
Hilbert space L?(M), with domain 


(10.7) D(A) = {u € L?(M) : Aue L?(M)}. 
It is easy to see that A is closed. Also, elliptic regularity implies 
(10.8) D(A) = H™(M). 


Since A is closed and densely defined, its Hilbert space adjoint is defined, also as 
a closed, unbounded operator on L?(M), with a dense domain. The symbol A* is 
also our preferred notation for the Hilbert space adjoint. To avoid confusion, we 
will temporarily use A® to denote the adjoint on D’(M), so A’ € OPS™(M), 
A’ : H**™(M) — H*(M), for all s. Now the unbounded operator A* has 
domain 


(10.9) D(A*) = {u € L?(M) : |(u, Av)| < c(u)|lul|z2,V v € D(A)}, 
and then A*u is the unique element of L?(/) such that 
(10.10) (A*u,v) = (u, Av), forall vu € D(A). 


Recall that D(A) = H™(M). Since, for any u € H™(M),v € H™(M), we have 
(A‘u, v) = (u, Av), we see that D(A*) > H™(M) and A* = A’ on H™(M). 
On the other hand, (u, Av) = (A‘u, v) holds for all v € H™(M), u € L?(M), 
the latter inner product being given by the duality of H~™(M) and H™(M). 
Thus it follows that 


u € D(A*) = A*u= Aue L?(M). 
But elliptic regularity for A‘ € OPS{")(M/) then implies u € H™(M). Thus 


(10.11) D(A*)=H"(M),  A* =A" pm cay: 

In particular, if A is elliptic in OPS{")(M), m > 0, and also symmetric (i.e., 
A = A’), then the Hilbert space operator is self-adjoint; A = A*. For any \ € 
C\R, (AI — A)7!: L?(M) > D(A) = H™(M), so A has compact resolvent. 
Thus L?(M) has an orthonormal basis of eigenfunctions of A, Au; = Aj;u;, 
|A;| —* 00, and, by elliptic regularity, each u; belongs to C'°°(M/). 


11. The method of layer potentials 37 


Exercises 


In the following exercises, assume that / is a smooth, compact, Riemannian manifold. 
Let A € OPS™(M) be elliptic, positive, and self-adjoint, with m > 0. Let u; be an 
orthonormal basis of L?(M/) consisting of eigenfunctions of A, Au; = A;u;. Given 
f € D’(M), form “Fourier coefficients” f(j) = (f,u,;). Thus f € L?(M) implies 


(10.12) f= iu, 


with convergence in L?-norm. 

1. Given s € R, show that f € H*(M) if and only if > | f(j)|?(A;)?°/" < co. 

2. Show that, for any s € R, f € H*(M), (10.12) holds, with convergence in H*°-norm. 
Conclude that if s > n/2 and f € H*(M), the series converges uniformly to f. 

3. If s > n/2 and f € H*(M), show that (10.12) converges absolutely. (Hint: Fix 
zo € M and pick c; € C, |e;| = 1, such that cj f (j)uj (a0) > 0. Now consider 
eg FG )us-) 

4. Let —L be a second-order, elliptic, positive, self-adjoint differential operator on a com- 
pact Riemannian manifold M. Suppose A € OPS'(M) is positive, self-adjoint, and 
A? = —L +R, where R : D'(M) — C®(M). Show that A— /—L : D'(M) > 
c™(M). 

One approach to Exercise 4 is the following. 

5. Given f € H*(M), form 


u(y,n) =e" f(a), v(y2) =e Fa), 
for (y, x) € [0,00) x M. Note that 
fa 


(5,2 + E)u=o. (GtL)e= Ro(y, 2). 


Use estimates and regularity for the Dirichlet problem for 0? /Oy? + L on [0, 00) x M 
to show that u— v € C%([0,00) x M). Conclude that Ou/Oy — dv/dy| ,_o = 
(A—JV-L)f € C™(M). 

6. With L as above, use the symbol calculus of §4 to construct a self-adjoint A € 
OPS'(M), with positive principal symbol, such that A? + L € OPS~*(M). Con- 
clude that Exercise 4 applies to A. 

7. Show that OPS? o(M) has a natural Fréchet space structure. 


11. The method of layer potentials 


We discuss, in the light of the theory of pseudodifferential operators, the use of 
“single- and double-layer potentials” to study the Dirichlet and Neumann bound- 
ary problems for the Laplace equation. Material developed here will be useful in 
87 of Chap. 9, which treats the use of integral equations in scattering theory. 

Let 2 be a connected, compact Riemannian manifold with nonempty bound- 
ary; n = dim. Suppose 2 C M, a Riemannian manifold of dimension n without 


38 7. Pseudodifferential Operators 


boundary, on which there is a fundamental solution E(x, y) to the Laplace equa- 
tion: 


(11.1) A, E(z,y) = dy(x), 


where E(x, y) is the Schwartz kernel of an operator E(x, D) € OPS~*(M); we 
have 


(11.2) E(a,y) ~ en dist(x,y)?-" +--- 
asx — y, ifn > 3, while 


(11.3) E(z,y) ~ cg log dist(x,y) + --- 


ifn = 2. Here, c, = —[(n — 2)Area($"~)] ~* for n > 3, and cp = 1/27. The 
single- and double-layer potentials of a function f on OC are defined by 


(11.4) Se f(x) = f fy)E,y) dtu) 
o1@) 

and 

(IL) De fie) = [ fw) Seu) a8 
AQ 


for x € M \ OQ. Given a function v on M \ 0, for x € OQ, let v(x) and v_ (x) 
denote the limits of u(z) as z > x, from z € Qand z € M\Q = O, respectively, 
when these limits exist. The following are fundamental properties of these layer 
potentials. 


Proposition 11.1. For x € OO, we have 


(11.6) SE fx (w) = SE f_(e) = Sf (a) 
and 

1 1 
(11.7) Dé fx(0) = +5 f(z) + 5N F(a), 


where, for x € OQ, 


(11.8) si(e) = | Fy) E(e,y) Sy) 


and 


11. The method of layer potentials 39 


(11.9) Ha) =2 | Hd, (x,y) dS(y). 


Note that E(2, -) gq 1S integrable, uniformly in x, and that the conclusion in 
(11.6) is elementary, at least for f continuous; the conclusion in (11.7) is a bit 
more mysterious. To see what is behind such results, let us look at the more gen- 
eral situation of 


(11.10) v = p(x, D)(fo), 
where o € €’(M) is surface measure on a hypersurface (here 0), f € D’(OQ), 


so fo € €’(M). Assume that p(z,D) € OPS™(M). Make a local coordinate 
change, straightening out the surface to {x,, = 0}. Then, in this coordinate system 


v(2", tn) = [Fede vlc, 8, net dE, dé’ 


= Ott Da I; 


(11.11) 


for x, 4 0, where 
(11.12) q(an,2',€) = [plese gndeten dEn- 
If p(x, €) is homogeneous of degree m in €, for |€| > 1, then for |€’| > 1 we have 


(11.13) dias 2 )= le) Blew tale |), 


where w’ = €’/|E'| and 
Beu!,7) = f ple.u!, jet ac. 


Now, if m<-—1, the integral in (11.12) is absolutely convergent and 
q(%n, x’, €') is continuous in all arguments, even across x, = 0. On the other 

hand, if m = —1, then, temporarily neglecting all the arguments of p but the 
last, we are looking at the Fourier transform of a smooth function of one variable 
whose asymptotic behavior as £;, —> +00 is of the form CT &, 1 + Cy &;? 
From the results of Chap. 3 we know that the Fourier transform is smooth except 
at t, = O, and if Gi = C7, then the Fourier transform has a jump across 
XZ», = 0; otherwise there may be a logarithmic singularity. 

It follows that if p(z, D) € OPS™(M) and m < —1, then (11.10) has a limit 
on OQ, given by 


(11.14) lag = Of, Qe OPS™* (AN). 


40 7. Pseudodifferential Operators 


On the other hand, if m = —1 and the symbol of p(x, D) has the behavior that, 
for x € OQ, vz normal to 02 at x, 


(11.15) p(x,€+7vz) = £C(z,&)7~1 + O(777), 7 +00, 
then (11.10) has a limit from each side of 0, and 


(11.16) vt =Qaf, Qt € OPS*(AN). 


To specialize these results to the setting of Proposition 11.1, note that 


(11.17) St f = E(x, D)(fo) 
and 
(11.18) De f = E(x, D)X* (fo), 


where X is any vector field on M equal to 0/Ov on OQ, with formal adjoint X*, 
given by 


(11.19) X*y = —Xv — (div X)v. 


The analysis of (11.10) applies directly to (11.17), with m = —2. That the bound- 
ary value is given by (11.8) is elementary for f € C(0Q), as noted before. Given 
(11.14), it then follows for more general /. 

Now (11.18) is also of the form (11.10), with p(z,D) = E(a,D)X* € 
OPS~1*(M). Note that the principal symbol at x € 0 is given by 


(11.20) po(x,£) = —|€|~?(v(2), 6), 


which satisfies the condition (11.15), so the conclusion (11.16) applies. Note that 


po(x,€ + TVz) = = |e x= TUg,| Cee x= TV x), 


so in this case (11.15) holds with C(x, £€) = 1. Thus the operators Q+ in (11.16) 
have principal symbols + const. That the constant is as given in (11.7) follows 
from keeping careful track of the constants in the calculations (11.11)-—(11.13) 
(cf. Exercise 9 below). 

Let us take a closer look at the behavior of (0/0v,) E(x, y). Note that, for x 
close to y, if V;,, denotes the unit vector at y in the direction of the geodesic from 
x to y, then (for n > 3) 


(11.21) VyE(z,y) ~ (2—n)en dist(z, y)'~ "Vay +++ - 


If y € OQ and vy is the unit normal to 02. at y, then 


11. The method of layer potentials 41 


(11.22) Fo Bln) ~ (2 nen dist(2, 9)" Ves %9) +2 
Vy 


Note that (2—n)c, = —1/Area(S"~'). Clearly, the inner product (Vz, Vy) = 
a(x, y) restricted to (x, y) € OQ x OO is Lipschitz and vanishes on the diagonal 
x = y. This vanishing makes (OF /Ov,)(zx, y) integrable on OQ x OQ. It is clear 
that in the case (11.7), Q+ have Schwartz kernels equal to (0/Ov,)E(x,y) on 
the complement of the diagonal in OQ. x OQ. In light of our analysis above of the 
principal symbol of @-., the proof of (11.7) is complete. 

As a check on the evaluation of the constant cin Dé fs = +cf + (1/2)Nf, 
c = 1/2, note that applying Green’s formula to {(A1) - E(x, y) dy readily gives 
Q 


OE 
fear) dS(y)=1, forreQ, 
OVy 
0Q. 
0, forreO, 
as the value of Dé f for f = 1. Since Dé f, —Dé f_ = 2cf, this forces c = 1/2. 


The way in which +(1/2) f(x) arises in (11.7) is captured well by the model 
case of OQ a hyperplane in R”, and 


E((2",2n); (y',0)) = en[(e! —y')? + 22], 
when (11.22) becomes 


7) =n 
rae Ln), (y’, 0) = (2 _ 1) Geis, [(a’ — y')? + | ie 
though in this example N = 0. 

The following properties of the operators S and N are fundamental. 


Proposition 11.2. Given that 0Q is smooth, we have 
(11.23) S,N € OPS~'(AQ), _ S elliptic. 


Proof. That S has this behavior follows immediately from (11.2) and (11.3). The 
ellipticity at x follows from taking normal coordinates at 7 and using Exercise 3 
of §4, for n > 3; for mn = 2, the reader can supply an analogous argument. That 
N also satisfies (11.23) follows from (11.22) and the vanishing of a(z,y) = 
(Vz,y, Vy) on the diagonal. 


An important result complementary to Proposition 11.1 is the following, on the 
behavior of the normal derivative at OQ of single-layer potentials. 


Proposition 11.3. For x € OO, we have 


42 7. Pseudodifferential Operators 


(11.24) “8 iia = ef + N* f), 


where N# € OPS~1(AQ) is given by 


(11.25) N*¥ f(a) =2 i f(y) oF (a0) dS(y). 
002 


In case E(x, y) is real valued and E(x,y) = E(y, 2), 

(11.26) N# = N*. 

Proof. The proof of (11.24) is directly parallel to that of (11.7). To see on general 
principles why this should be so, use (11.17) to write (0/Ov)S¢£ f as the restriction 
to OQ of 

(11.27) XS f = XE(x, D)(fo). 

Using (11.18) and (11.19), we see that 


Dé f+xXS8l f = |X, E(x, D)\(fo) — E(x, D)(div X)(fo) 


(11.28) = A(x, D)(fo), 


with A(z, D) € OPS~?(M), the same class as E(x, D). Thus the extension of 
A(x, D)(fo) to 0Q is straightforward, and we have 


a 
(11.29) aot fs = —DE fa + Ala, D) (Fo) gg 


In particular, the jumps across O02) are related by 


(11.30) o set, 
Ov 


i S(f_ =Dé f_-—Deé f,, 
Ov 


consistent with the result implied by formulas (11.7) and (11.24). 

It is also useful to understand the boundary behavior of (0/Ov)Dé f. This is 
a bit harder since 0? E/Ov,0v, is more highly singular. From here on, assume 
E(x,y) = E(y, x), soalso A, E(x, y) = 6,(y). We define the Neumann operator 
(11.31) N : C%(82) —+ C%(A2) 
as follows. Given f € C®(0Q), let u € C%(Q) be the unique solution to 


(11.32) Au=0onQ, u=f ondQ, 


11. The method of layer potentials 43 


and let 


Ou 
~ a ae 


(11.33) Nf 


the limit taken from within 22. It is a simple consequence of Green’s formula that 
if we form 


11.34) f [7(0) Sew) -N FW) EC] Sy) = DE fe) - SEN TC, 
0Q 


for « € M \ OQ, then 


Dé f(x) —SEN f(x) =u(az), cEQ, 


(11.35) a 
0, ce M\Q, 


where wu is given by (11.32). Note that taking the limit of (11.35) from within Q, 
using (11.6) and (11.7), gives f = (1/2) f +(1/2)N f — SN f, which implies the 
identity 


(11.36) SN =—=(I—N). 


Taking the limit in (11.35) from M \ Q gives the same identity. In view of the 
behavior (11.23), in particular the ellipticity of S, we conclude that 


(11.37) N € OPS'(0Q), _ elliptic. 


Now we apply 0/0v to the identity (11.35), evaluating on OC from both sides. 
Evaluating from 2 gives 


Pica 


(11.38) Dl fz -—-—SINfrH=Nf, 
OV OV 
while evaluating from M \ 2 gives 
(11.39) 2 ny fo = 2 song =0. 
OV OV 


In particular, applying 0/Ov to (11.35) shows that (0/Ov)Dé f+ exists, by 
Proposition 11.3. Furthermore, applying (11.24) to (0/dv)S@ N f+, we have a 
proof of the following. 


Proposition 11.4. For x € 00, we have 


0 


(11.40) os 


De fe(a) = stl + N® Nf. 


44 7. Pseudodifferential Operators 


In particular, there is no jump across OQ. of (0/dv)DE f. 


We have now developed the layer potentials far enough to apply them to the 
study of the Dirichlet problem. We want an approximate formula for the Poisson 
integral wu = PI f, the unique solution to 


(11.41) Au = 0inQ, Nil a =f. 


Motivated by the Poisson integral formula on IR”, we look for a solution of the 
form 


(11.42) u(x) =Délg(x), «EQ, 


and try to relate g to f. In view of Proposition 11.1, letting s > z € OQ in (11.42) 
yields 


(11.43) u(z) = 5(9+ Ng), for z € OQ. 


Thus if we define u by (11.42), then (11.41) is equivalent to 


(11.44) f==(+N)g9. 


No] Fr 


Alternatively, we can try to solve (11.41) in terms of a single-layer potential: 
(11.45) u(z) = Sh(x), cE. 
If u is defined by (11.45), then (11.41) is equivalent to 
(11.46) f = Sh. 
Note that, by (11.23), the operator (1/2)(Z + N) in (11.44) is Fredholm, of index 
zero, on each space H*(0Q). It is not hard to verify that S is elliptic of order —1, 
with real principal symbol, so for each s, 

S: H°1(0Q) — H°(AQ) 

is Fredholm, of index zero. 

One basic case when (11.44) and (11.46) can both be solved is the case of 
bounded 2 in M = R”, with the standard flat Laplacian. 


Proposition 11.5. If Q is a smooth, bounded subdomain of R", with connected 
complement, then, for all s, 


(11.47) I+.N: H*(0Q) —> H*(0Q) and S: H*~'(dQ) — H*(0Q) 


11. The method of layer potentials 45 


are isomorphisms. 


Proof. It suffices to show that J + N and S are injective on C™ (OQ). First, 
if g € C™(OQ) belongs to the null space of J + N, then, by (11.43) and the 
maximum principle, we have Dé g = 0 in 2. By (11.7), the jump of Dé g across 
OQ. is g, so we have for v = Dé glo, where O = R” \ Q, 


(11.48) Av=00n0, v5, =-9. 


Also, v clearly vanishes at infinity. Now, by (11.40), (0/0v)Dé g does not jump 
across 02, so we have Ov/Ov = 0 on OQ. But at a point on OQ where —g is 
maximal, this contradicts Zaremba’s principle, unless g = 0. This proves that 
I + N is an isomorphism in this case. 

Next, suppose h € C°(0Q) belongs to the null space of S'. Then, by (11.46) 
and the maximum principle, we have Sh = 0 on 2. By (11.24), the jump of 
(0/Ov)S¢é h across OQ is —h, so we have for w = S£ h|o that 


Ow 


(11.49) Aw = 0onQ, 


? 


Ov lan 


and w vanishes at infinity. This time, S? h does not jump across OQ, so we also 
have w = 0 on OQ. The maximum principle forces w = 0 on O, so h = 0. This 
proves that S is an isomorphism in this case. 


In view of (11.6), we see that (11.45) and (11.46) also give a solution to Au = 
0 on the exterior region R” \ Q, satisfying u = f on OO and u(x) > 0 as 
|x| — co, if n > 3. This solution is unique, by the maximum principle. 


The topological restriction on Q in Proposition 11.5 can be circumvented by 
the device of altering the operator A outside 2. For such an improvement, we 
return to the folowing setting, 


(11.50) QCM, OQ smooth, M compact, connected Riemannian manifold, 


and set 


L=A-V,VeC™(M), V>0onM, V =0onQ, 


(11.51) _ 
V > 0 somewhere on each component of O = M\.. 


Then let F = L~!, with integral kernel E(x, y), and define S@ and Dé as before, 
using this integral kernel. We have the following. 


Proposition 11.6. in the setting of (11.50)-(11.51), we continue to have that the 
operators I + N and S in (11.47) are isomorphisms. 


Proof. As in Proposition 11.5, it suffices to show that J + N and S are injective 
on C™(0Q). We tackle J+ N. If g € C™(0Q) and (I+ N)g = 0, as in the proof 


46 7. Pseudodifferential Operators 


of Proposition 11.5 we have Df g = 0 on 22. Again, by (11.7), the jump of Dé g 
across O02. is g, so we have for v = Dé glo, 


(11,52) (A—V)v=000, |,,.=—-g9. 


Again, (11.41) implies 0,,D£ g does not jump across OQ, so 0,v = 0 on 02. Now 
(11.52) plus Green’s formula gives 


(11.53) [vray = [(aoway =- [mop av, 
oO oO oO 


hence v = const = 0o0nO,sog = 0. 
Proof that S' is injective involves a modification of the proof in Proposition 
11.5, and we leave this to the reader. 


Let us now consider the Neumann problem 


(11.54) Au = 0onQ, ot = yon dO. 

We can relate (11.54) to (11.41) via the Neumann operator: 

(11.55) p=Nf. 

Let us assume that 2 is connected; then 

(11.56) Ker V = {f = const. on Q}, 

so dim Ker NV = 1. Note that, by Green’s theorem, 

(11.57) (Nf, 9)12(an) = —(du, dv) 12a) = (f, Ng) 12(a9); 
where u = PI f, v = PI g, so.N is symmetric. In particular, 

(11.58) (Nf, f)12(02) = —IldullZ2(ay; 


so N is negative-semidefinite. The symmetry of NV’ together with its ellipticity 
implies that, for each s, 


(11.59) N : H**1(0Q) —> H*(0Q) 


is Fredholm, of index zero, with both Ker V and R(N a of dimension 1, and so 


(11.60) R(N) = {~ © H(A): [vas = 0}, 


0a. 


11. The method of layer potentials 47 


this integral interpreted in the obvious distributional sense when s < 0. 
By (11.36), whenever S' is an isomorphism in (11.47), we can say that (11.55) 
is equivalent to 


(11.61) (I—N)f =—2S¢. 


We can also represent a solution to (11.54) as a single-layer potential, of the form 
(11.45). Using (11.24), we see that this works provided h satisfies 


(11.62) (I— N#)h = —2y. 


In view of the fact that (11.45) solves the Dirichlet problem (11.41) with f = Sh, 
we deduce the identity y = NV'Sh, or 


(11.63) NS = —3(I- NA), 


complementing (11.36). Comparing these identities, representing SNS in two 
ways, we obtain the intertwining relation 


(11.64) SN* = NS. 


Recall that, under the reality and symmetry hypotheses on E(x,y), we have 
N# = N*, as noted in (11.26). 

The method of layer potentials is applicable to other boundary problems. An 
application to the “Stokes system” will be given in Chap. 17, 8A. 


Layer potentials on non-smooth domains 


The material above dealt with domains 2 with C' boundary. There has been 
much work on the behavior of layer potentials on rougher domains, and implica- 
tions for boundary problems in PDE, such as the Dirichlet problem and Neumann 
problem for the Laplace operator. There are fairly straightforward extensions in 
the cases where 02 is a C!*” hypersurface in IR” (or more generally M). In such 
a case, operators like N and S' can be analyzed in spaces of pseudodifferential 
operators of limited regularity, such as we will introduce in Chapter 13. Some 
such results are given in §3.4 of [T5]. It is shown there that if OO is of class 
Ccl*", r € (0,1), then 


I+.N : H°(0Q) = H°(8Q), 


(11.65) _ 
S: H°(0Q) —> H'**(aQ),  |s| <r. 


Furthermore, if 00 is of class C!, with a modulus of continuity w satisfying the 
Dini condition 


48 7. Pseudodifferential Operators 


1 
(11.66) | w(t)t~' dt < o, 
0) 
then 
(11.67) I+N+17(80) > L760), S: 17(6Q) > Aan). 


In fact, as seen in [T5], one has an extension from L?(0Q) to L?(0Q), and from 
H*(0Q) to the L? -Sobolev space H*:?(0Q), for 1 < p < co. (In this work, L?- 
Sobolev spaces are introduced in Chapter 13.) For example, for C' boundaries 
satisfying (11.66), we have 


(11.68) I+N : L?(8Q) => L?(AQ), S: L?(AQ) => Han), 


for 1 < p < oo. In (11.65), (11.66), and (11.67), N is compact, as it is in (11.47), 
so the Fredholmness of J + N is automatic. 

For rougher boundaries, matters become more difficult, starting with showing 
that (11.9), which we rewrite as 


(11.69) Nf(«) =2PV ) f(y)0., E(«, y) a8(y), 
an 


is bounded on L?(0Q) and other important function spaces. Breakthroughs ini- 
tiated in [Ca3] and [CMM] led to such boundedness results when Q C R” is a 
Lipchitz domain, whose boundary is locally the graph of a Lipschitz function. In 
such a case, one has 


(11.70) N : L?(8Q) —+ L9(8Q), 1<p<o. 


When 02 is C, N is even compact, though compactness here is much deeper 
than in the setting of C! boundary with (11.66) holding. This led to work on the 
Dirichlet problem on C! domains in R” in [FJR]. 

For general Lipschitz domains, N is not compact, and further techniques enter 
the picture. Isomorphism as in (11.67) was established in [Ver], making use of 
Rellich identities. (See Exercises 6—7 below.) This allowed for a treatment of the 
Dirichlet problem (11.41) on Lipschitz domains in R”. See [DK] for the Neumann 
problem and [JK] for nonhomogeneous extensions. The theory of layer potentials 
and applications to such boundary problems was developed in the setting of Lip- 
schitz domains in Riemannian manifolds in [MT1] and [MT2]. To mention just 
one result, we have the following. 


Proposition 11.7. [f Q is a Lipschitz domain in a compact Riemannian manifold 
M, then 


(11.71) PI: L?(09) —> H*/?(Q). 


11. The method of layer potentials 49 


An exposition of some of this work is given in Chapter 4 of [T5]. 

A further breakthrough, initiated in [D] and [DS], extends the boundedness 
theory of layer potentials on L?(OQ) from Lipschitz domains to a class of domains 
known as uniformly rectifiable (UR) domains, defined as follows. Suppose 22 is an 
open set in a compact, n-dimensional Riemannian manifold 1/7, with nonempty 
boundary 0Q. We assume 


(11.72) H"*(AQ) < 00, H"*(A0\ 0,0) =0, 


where H”~! denotes (n — 1)-dimensional Hausdorff measure and 0, is the 
measure-theoretic boundary, consisting of p € OQ such that the densities of both 
Q and M \ Q at p are positive. We next assume that there exist C; € (0,00) such 
that 


(11,73) Cyr™-? < H"-1(B,(qg)N OQ) < Car™ |, Vaqe dO, r € (0,1), 


where B,.(q) is the ball in M, centered at q, of radius r. (When (11.73) holds, we 
say OF is Ahlfors regular.) Finally, we assume 02. has “big pieces of Lipschitz 
surfaces.” In detail, there exist 6, Z € (0,00) such that for each x € 00, RE 
(0, 1], there is a Lipschitz map 


(11.74) yg: BE 3M, Bt ={ceR": |2| < R}, 
such that 
(11.75) ||[Vellz0 < L, H"*(00N Br(x) 1 y(BR*)) > OR". 


For one class of examples, if Q C M and if 02 is locally the graph of a function 
(11.76) A:R” 15M, VA€bmo(R""'), 


the space bmo denoting the local John-Nirenberg space (we say 2 is a bmo 
domain), then 2 is a UR domain. Results of [D] yield the L?-bounds (11.70) when 
Q is a UR domain in R” and A the flat Laplacian. Such results in the setting that 
Q is a UR domain in a compact Riemannian manifold are established in [HMT], 
which also proves the boundary trace result (11.7) in that setting. 

The class of UR domains is a natural setting for another class of layer poten- 
tials, called Cauchy transforms. In [MMT] there is a study of Cauchy transforms 
and Calderén projectors on UR domains, with applications to Toeplitz operators 
acting on functions on the boundaries of such domains. This is developed fur- 
ther in [MMT2], into a study of multidimensional Riemann-Hilbert problems on 
domains with uniformly rectifiable interfaces. 

An important subclass of bmo; domains consists of vmo; domains, whose 
boundaries are locally graphs of 


50 7. Pseudodifferential Operators 
(11.77) A:R"™ 14M, VA€vmo(R"~?), 


where vmo(R”~') is the closure of the class of uniformly continuous functions 
in bmo(R”~*). It was shown in [Hof] that, if Q C R” is a vmo; domain and 
A the flat Laplacian, then N in (11.70) is compact, for 1 < p < oo. The paper 
[HMT] extends this compactness result to a larger class of UR domains, called 
there regular SKT domains, named after seminal work of [Sem] and [KT]. The 
paper [HMT] applied layer potential methods to a number of boundary problems 
for the Laplace operator and other elliptic operators on regular SKT domains. Fur- 
ther work on this, including natural boundary problems for the Hodge Laplacian, 
is covered in [M3T]. See also the treatise [MMM] for an expansive treatment of 
analytical issues related to the theory of layer potentials. 


Exercises 


1. Let M be a compact, connected Riemannian manifold, with Laplace operator L, and 
let 2 = [0,1] x M, with Laplace operator A = 0?/dy? + L, y € [0, 1]. Show that 
the Dirichlet problem 


Au=0onQ, u(0,x2) = fo(x), u(1,x) = fi(z) 


has the solution 


yv (l-y)V 


u(y,) =e ¥Y~ "yo +e “pi + KY, 
where « is the constant « = (vol M)~ ae ene — fo) dV, and 

yo =(1-e VY") "(fo—e Y fi — fi); 

plas) =e fa), 


the operator (1 — e~?Y~“)~! being well defined on (ker L)+. 
2. If N fo(x) = (Ou/Oy) (0, x), where u is as above, with f; = 0, show that 


N fo = -—V-Lfo + Rfo, 


where R is a smoothing operator, R : D'(M) — C%(M). Using (11.37), deduce 
that these calculations imply 


V/—L € OPS"(M). 


Compare Exercises 4-6 of §10. 
3. If PI: C° (AQ) — CQ) is the Poisson integral operator solving (11.41), show that, 
forx € Q, 


pu f(a) = f k(x.u) f(y) a5ty), 
dQ 
with 
k(x, y)| < C(d(a, y)? + p(a)?) -”?, 


Exercises 51 


where n = dim Q, d(x, y) is the distance from x to y, and p(z) is the distance from 
x to an. 

4. If M is an (n — 1)-dimensional surface with boundary in Q, intersecting 02 transver- 
sally, with OM C OO, and p : C*®(Q) — C™(M) is restriction to M, show that 


po PI: L7(8Q) — L?(M). 


(Hint: Look at Exercise 2 in §5 of Appendix A on functional analysis.) 
5. Given y € Q, let G', be the “Green function,” satisfying 


AG, = dy, Gy =0ondQ. 
Show that, for f € C°(0Q), 
PL f(y) = f (e) &.Gy(2) aS(2. 
02 


(Hint: Apply Green’s formula to (PI f, AG,) = (PI f, AG,) — (A PI f, Gy).) 
6. Assume wu is scalar, Au = f, and w is a vector field on Q. Show that 


fw w)|Vul? dS = 2 [ (Wou)(d-u) dS — 2 [vans dV 


digg: = 
+ [av w)|Vul? dV — 2 | (Cu9)(Vu, Vu) dV, 
2 Q 


where g is the metric tensor on 2. This identity is a “Rellich formula.” 
(Hint: Compute div((Vu, Vu)w) and 2 div(V,u - Vu), and apply the divergence 
theorem to the difference.) 

7. In the setting of Exercise 6, assume w is a unit vector field and that (v,w) > a > 0 
on OQ. Deduce that 


$f \wul? as < 2 [au as+ f is? av 
foe} an 2 


+: [{iaiw w| + 2|Def w| + U\[Vul? dV. 
Q 


(11.79) 


When Au = f = 0, compare implications of (11.79) with implications of (11.37). 
See [Ver] for applications of Rellich’s formula to analysis on Lipschitz domains 2. C 
R”, and [MT1] for Lipschitz domains in a Riemannian manifold. 

8. What happens if, in Proposition 11.5, you allow O = R” \ © to have several con- 
nected components? Can you show that one of the operators in (11.47) is still an 
isomorphism? 

9. Calculate q(an, 2’, €’) in (11.13) when p(a,€) = €;|€|~*. Relate this to the results 
(11.7) and (11.24) for Dé f+ and 0, Sé f+. (Hint. The calculation involves f(1 + 
€7)~1e67 dt = we!) 


52 7. Pseudodifferential Operators 
12. Parametrix for regular elliptic boundary problems 


Here we shall complement material on regular boundary problems for elliptic 
operators developed in §11 of Chap.5, including in particular results promised 
after the statement of Proposition 11.16 in that chapter. 

Suppose FP is an elliptic differential operator of order m on a compact manifold 
M with boundary, with boundary operators B; of orderm,;,1 <j < 4, satisfying 
the regularity conditions given in §11 of Chap. 5. In order to construct a parametrix 
for the solution to Pu = f, Bjulam = g;, we will use pseudodifferential operator 
calculus to manipulate P in ways that constant-coefficient operators P(D) were 
manipulated in that section. To start, we choose a collar neighborhood C of OM, 
C [0,1] x OM; use coordinates (y, x), y € [0,1], € OM; and without loss of 
generality, consider 


12.1 yao ar po 
( " ) u= dy + yy j(y, 2, =) Bai 


the order of A;(y,x, D,) being < m — j. We convert Pu = f to a first-order 
system using v = (v1,.--,Um)*, with 


(12.2) vU= fae anacts » Uj = ae ae ey Um = On as; 
as in (11.42) of Chap. 5. Here, A can be taken to be any elliptic, invertible opera- 


tor in OPS'(0M), with principal symbol |£| (with respect to some Riemannian 
metric put on 0M). Then Pu = f becomes, on C, the system 


(12.3) — =K(y,z,Dz)v + F, 


where F' = (0,...,0, f)’ and 


Oo A 
O A 
(12.4) i 2 & | 
A 
Co Ci Co ome 
where 
(12.5) C(y, 2, Dr) = —Aj(y, 2, Fay ae 


is a smooth family of operators in OP.S'(0M), with y as a parameter. As in 
Lemma 11.3 of Chap. 5, we have that P is elliptic if and only if, for all (a,€) € 
T*0OM \0, the principal symbol Ky (y, x, €) has no purely imaginary eigenvalues. 


12. Parametrix for regular elliptic boundary problems 53 


We also rewrite the boundary conditions Bju = g; at y = 0. If 


or 
(12.6) B;= >> djx(2, Dz) Buk 


k<m; 
at y = O, then we have for v; the boundary conditions 


2) )) ta@D ye) =i Gah, Le7ee 


k<m; 
where b; (x, D) has the same principal symbol as b;,(x, D). We write this as 
(12.8) B(a, D,z)v(0) =h, B(z,Dz) € OPS°(OM). 


We will construct a parametrix for the solution of (12.3), (12.8), with F = 0. 

Generalizing (11.57) of Chap.5, we construct Eo(y,2,€) for (#,€) € 
T*0M\0, the projection onto the sum of the generalized eigenspaces of 
Ki(y,x,€) corresponding to eigenvalues of positive real part, annihilating the 
other generalized eigenspaces, in the form 


(12.9) Eo(y, 2, €) = = AG — Ki(y,x,€)) ‘dé, 


Y 


where y = 7(y,2,&) is a curve in the right half-plane of C, encircling all the 
eigenvalues of K1(y,x,&) of positive real part. Then Eo(y,2,&) is homoge- 
neous of degree 0 in €, so it is the principal symbol of a family of operators in 
OPS°(0M). 

Recall the statement of Proposition 11.9 of Chap. 5 on the regularity condition 
for (P, B;,1 <j < @). One characterization is that, for (x,€) € T*OM \ 0, 


(12.10) Bo(a, €) : V(x, €) —+ C* isomorphically, 


where V (x, €) = ker Eo(0, x, €), and Bo(zx,€) : C” + C? is the principal symbol 
of B(x, D,). Another, equivalent characterization is that, for any 7 € C, (x, €) € 
T*OM \ 0, there exists a unique bounded solution on y € [0, 00) to the ODE 


(12.11) By 7 Bie, )p=0, Bo(x,€)y(0) = n. 


In that case, of course, y(0) = y(0, x, €) belongs to V(x, €), so y(y, x, €) is actu- 
ally exponentially decreasing as y > ++0o, for fixed (a, €), and it is exponentially 
decreasing as |£] — oo, for fixed y > 0,2 € OM. 

On a conic neighborhood I of any (2, £0) € T*OM \ 0, one can construct 
Uo(y, x, €) smooth and homogeneous of degree 0 in €, so that 


54 7. Pseudodifferential Operators 


* E, 0 
12:12 UoKiUs* = ( ) 
( ) 0AL1%9 0F 


where Fi (y,x,€) has eigenvalues all in Re ¢ < 0 and F; has all its eigenval- 
ues in Re ¢ > 0. If we set w = Uo(y, 2, D)v, then the equation 0v/dy = 
K(y, x, D,)v is transformed to 


(0) 
(12.13) io = ie .) w + Aw = Gu + Aw, 


where E(y,x,D,) and F(y,x,D,) have E, and F as their principal sym- 
bols, respectively, and A(y, x, D,) is a smooth family of operators in the space 
OPS®°(0M). 

We want to decouple this equation more completely into two pieces. The next 
step is to decouple terms of order zero. Let wh) = (I + Vi)w, with Vi € 
OPS~" to be determined. We have 


(12.14) 
dw 
Oy 


= (1+ V)GT+) tw + (14+ V)AT + Vi)7tw® 4+--. 


= Gw + (ViG-— GV, + A)w +---, 


where the remainder involves terms of order at most —1 operating on w“!). We 
would like to pick V; so that the off-diagonal terms of V}G — GV, + A vanish. 
We require V; to be of the form 


If A is put into 2 x 2 block form with entries A;,, we are led to require that (on 
the symbol level) 


Vio ky =, F\Vi2 = —Ajp, 


(12.15) 
Va Fy — £1, V2. = —Aat. 


That we have unique solutions V;,(y, x, €) (aomogeneous of degree —1 in €) is a 
consequence of the following lemma. 


Lemma 12.1. Let F € M, xv, the set of v x v matrices, and E € M,,x,,. Define 
Ww: Mixp > Moxy by 
WT) = TF — ET. 


Then w is bijective, provided E and F have disjoint spectra. 


12. Parametrix for regular elliptic boundary problems 55 


Proof. In fact, if { f;} are the eigenvalues of F’ and {e;} those of E, it is easily 
seen that the eigenvalues of w are { f; — ex}. 


Thus we obtain solutions Vjz and V2; to (12.15). With such a choice of the 
symbol of kK, we have 


(1) 
(12.16) PE Oy Ae. laa a Bi, 
Oy Ao 


with B € OPS~'. To decouple the part of order —1, we try w?) = (I+ Vo)w™ 
with V2 € OPS~?. We get 


(2) 
(12.17) — = Gu + Go 7s w?) + (VoG — GV, + B)w? +---, 


so we want to choose V2 so that, on the symbol level, the off-diagonal terms of 
V2G — GV + B vanish. This is the problem solved above, so we are in good 
shape. 

From here we continue, defining w) = (I+ V;)w9-) with V; € OPS~!, 
decoupling further out along the line. Letting w = (J + V)v, with 


(12.18) I+V~---(I+Va)(I+Va)I+Vi), VeOPs, 
we have 

Ow E’ 86 
(12.19) Oy = ( o) w, modC®, 


with E’ = E, F’ = F mod OPS". The system (12.3) is now completely 
decoupled. 

We now concentrate on constructing a parametrix for an “elliptic evolution 
equation” 


(12.20) 5 = Ely,x,De)u, u(0)= f, 


where F is a k x k system of first-order pseudodifferential operators, whose prin- 
cipal symbol satisfies 


(12.21) spec Fi (y,z,€) C{C EC: Re < —Colé| < OF}, E40, 


for some Co > 0. We look for the parametrix in the form (in local coordinates on 
OM) 


(12.22) u(y) = / A(y, x, €)e'* f(€) dé, 


56 7. Pseudodifferential Operators 


with A(y, x, €) in the form 


(12.23) A(y,2,€) ~ > Aj(y, 2,8), 


and the A;(y,,€) constructed inductively. We aim to obtain A(y, x, €) bounded 
in S} 9, for y € [0, 1], among other things. In such a case, 


a2) (5 B)u=eenyn [ (F - Lena.))e* FO a, 
where 


1 
(12.25) Ly, 2,8) ~ Y) GEO Y, ©, Alas §)- 


a>0 


We define Ao(y, x, €) by the “transport equation” 


(12.26) FpAolusi€) = Blu, 6)Ao(u.2.8), ABA =7. 


If EF is independent of y, the solution is 
Ao(y,a,€) = 2), 


In general, Ao(y,xz,€) shares with this example the following important 
properties. 


Lemma 12.2. For y € [0,1], k, = 0,1,2,..., we have 
(12.27) y* Dy Ao(y, x, €) bounded in S7§**. 


Proof. We can take C2 € (0, Co) and M large, so that E(y, x, €) has spectrum in 
the half-space Re ¢ < —C2|&|, for |€| > M. Fixing kK € (0, C2), if S(y,o, 2, §) 
is the solution operator to 0B/Oy = E(y, x, €)B, taking B(o, x, €) to Bly, x, ), 
then, for y > oa, 


(12.28) |S(y,0,2,€)B]) < Ce“KY-I8|B), for |€| > M. 
It follows that, for y € [0, 1], 
(12.29) |Ao(y, 2, €)| < C eW Mulél, 


which implies 
ly” Ao(y,2,6)| < Ca(Q)7* en Rulel/2, 


12. Parametrix for regular elliptic boundary problems 57 


Now Ao; = 0Ao0/0§; satisfies 


) OE 
By 14 = Ely, x, €)Aoj ca BE, (88) Ao, Ao; (0, 2, €) = 0, 
SO 
d OE 
(12.30) Aaj(nan6) = f° Sy,0,2.8) = (0,2, 6)A0( 0,26) do 
0 J 


which in concert with (12.28) and (12.29) yields 


(12.31) w Ao(y,2,6)| < Cye*ulél < Cle) be KUlél/2, 


Lag; 


Inductively, one obtains estimates on De D§ Ao(y, x, €) leading to the £ = 0 case 
of (12.27), and then use of (12.26) and induction on @ give (12.27) in general. 


For j > 1, we define A,(y, x, €) inductively by 


OA; 
(12.32) Be = Ele, Aju. £) + Ri(ysa,€), Aj(0,2,8) = 0, 


where 


1 
(12.33) Ri(yeé= YP GEOY,2,€) Anaya, €)- 
l<jltlol=j —" 


Then, if, as above, S(y, 0, x, €) is the solution operator to the equation 0B/Oy = 
E(y, x, &€)B, we have 


(12.34) Aly.2.8) = [ “SEO Rom Ota. $24. 
0 


The arguments used to prove Lemma 12.2 also establish the following result. 


Lemma 12.3. For y € [0,1], k, 2 = 0,1,2,...,7 > 1, we have 
(12.35) y* Di A;(y, x, €) bounded in Syg-***. 


A symbol satisfying the condition (12.35) will be said to belong to P~J. In fact, 
it is convenient to use the following stronger property possessed by the symbols 
A,(y,x,€), for 7 > 0. Given the hypothesis (12.21) on spec Ey (y, 7, €), letO0 < 
Cy < Co. Then 


(12.36) Aj(y,2,€) = B;(y,x,6)e 4), with B,(y,x,€) EP. 


58 7. Pseudodifferential Operators 


We will say A;(y,2,€) € P-? if this holds or, more generally, if it holds modulo 
a smooth family of symbols S(y) € S~°°, y € [0,1]. The associated families of 
operators will be denoted OPP~/ and OPP, respectively. 

Operators formed from such symbols have the following mapping property, 
recapturing the Sobolev space regularity established for solutions to regular ellip- 
tic boundary problems in Chap. 5. 


Proposition 12.4. If A = A(y,x, Dz) has symbol 


Aly, 2,6) = Bly,2,e FO"), Biy,2,6)eP 7, 
then, for s > —j — 1/2, 
(12.37) A: H®(O@M) — H*+544/2(7 x @M). 


Proof. First consider the case s = —1/2,7 = 0. As B(y,x, D,) is bounded in 
L(L?(M)) for y € [0,1], we have, for f ¢ H~!/2(0M), 


1 1 
i. |A@) fIZz@andu < C ‘| lem FBacoan dn 


= C2||A~/? f\lF2cam) = Calle" 4 0-1? FIP oan), 
with C2 = C/(2C‘), since 


(Oe. PI ae hd), 


This proves (12.37) in this case. The extension to s = k — 1/2 (k = 1,2,...), 
j = Ois straightforward, and then the result for general s > —1/2, 7 = 0 follows 
by interpolation. The case of general 7 is reduced to that of 7 = O by forming 
A(y, x, €)(€)~4. One can take any j € R. 


Having constructed operators with symbols in P2 as parametrices of (12.20), 
we now complete the construction of parametrices for the system (12.3), (12.8), 
when the regularity condition (12.10) holds. Using a partition of unity, write h as 
asum >> h,, each term of which has wave front set in a conic set I’; on which the 
decoupling procedure (12.12) can be implemented. We drop the subscript 7 and 
just call the term h. Then, we construct a parametrix for w = (I+ V)Uov, so that 
w solves (12.19), with w(0) = (f,0)*. Set U = (I + V)Up, and let U~ denote 
a parametrix of U. The solution w(y) takes the form w(y) = (wi(y), 0), with 


(12.38) wily) = Ar(y,,Dz)f, Ar(y,2,€) € Pe, 


using the construction (12.22)—(12.34). Note that v(0) = U~'(f,0)’ =U 1 Sif, 
where here and below we set Ji f = (f,0)'. Then 


(12.39) Bu(0) = BU" Af, 


12. Parametrix for regular elliptic boundary problems 59 


so the boundary condition (12.8) is achieved (mod C'°°) provided f satisfies (mod 
Cc) 


(12.40) BUA f =h. 
The regularity condition (12.10) is precisely the condition that BU~+J, is an 


elliptic \ x A system, in OPS°(0M). Letting Q € OPS°(0M) be a parametrix, 
we obtain 


(12.41) vo(y) = U(y)  TAr(y)Qh = A* (y)h. 


Recall that Q € OPS°(OM), U(y)~! is a smooth family of operators in 
OPS®°(0M), and Ai(y) € OPP®. We can then say the following about the 
composition A*(y) = A#(y, a, Dz). 


Lemma 12.5. Given P;(y), smooth families in OPS"s(0M), and A(y) € 
OPP®, we have 


(12.42) P,(y)A(y)Po(y) = B(y) € OPPETM™ +m | 
The proof is a straightforward application of the results on products from §3. 
Consequently, we have a solution mod C'°° to (12.3), (12.8), constructed in the 


form u(y) = A#(y)h, with A¥(y) € OPP?®. Finally, returning to the boundary 
problem for P, we have: 


Theorem 12.6. /f(P,B;,1 <j < @) isa regular elliptic boundary problem, then 
a parametrix (i.e., a solution mod C'°°) for 


(12.43) Pu=0onM, Bju=g; ondM 
is constructed in the form 
é 
(12.44) u=S°Qi95; 
j=l 
where Qj; is C% on the interior of M, and, on a collar neighborhood C = 
[0,1] x OM, 


(12.45) ig; = Qi Wo Oy) €OPPL™. 


Recall that m,; is the order of B;. Here, the meaning of solution mod C° to 
(12.43) is that if u* is given by (12.44), then 


(12.46) Pu* €C®(M), Byu* — 9; € C°(OM). 


60 7. Pseudodifferential Operators 


Of course, the regularity results of Chap. 5 imply that if u is a genuine solution to 
(12.43), then u — u* € C®(M). 
The following is an easy route to localizing boundary regularity results. 


Proposition 12.7. Take A(y,x,£) € P~J. Let py, € C°(OM), and assume 
their supports are disjoint. Then 


(12.47) f €D'(OM) = vAty,x, D)wf € C™((0,1] x OM). 
Proof. Symbol calculus gives 
pA(y,2,D)\peP*, Vk>0. 


Hence this is a smooth family of elements of OP.S~ (0M). This readily gives 
(12.47). 


Proposition 12.7 immediately gives the following. 


Corollary 12.8. In the setting of Theorem 12.6, if O C OM is open and gj € 
C(O) for each j, then u € C™ on aneighborhood in M of O. 


Exercises 


1. Suppose A(y) € OPP™. Show that 


feu 


(12.48) Dy 


A(y)|,-0 = Qf, Q; € OPST97 (OM). 


If A(y) € OPP? is given by the construction (12.24)-(12.34), show that Q; € 
OPS3(0M). 

2. Applying the construction of this section to the Dirichlet problem for A on M, show 
that the Neumann operator NV, defined by (11.32)-(11.33), satisfies 


(12.49) N € OPS'(8M), 


thus providing a proof different from that used in (11.37). 
3. Show that A(y, x, €) belongs to P2” if and only if, for some € > 0 and all N < co, 


(12.50) |Dy D3 D¢ Ao(y, x, €)| < Cape e “Hl (e)™** I"! + Crrape(é) 
4. If A(y, x, €) € Pe’, show that, for some & > 0, you can write 
(12.51) A(y,”,D) =e“ By, 2, D), Bly, 2,6) € P™, y € [0,1], 


modulo a smooth family of smoothing operators. 
5. If uw = PIf is the solution to Au = 0, | = f, use Proposition 12.4 and 
Theorem 12.6 to show that 


(12.52) PI: H*(8Q) — H***/7(Q), Vs>- 


13. Parametrix for the heat equation 61 


Compare the regularity result of Propositions 11.14—11.15 in Chap. 5. 


13. Parametrix for the heat equation 


Let L = L(x, D) be a second-order, elliptic differential operator, whose princi- 
pal symbol L2(x, €) is a positive scalar function, though lower-order terms need 
not be scalar. We want to construct an approximate solution to the initial-value 
problem 


Ou 
(13.1) Bt =-lu, u(0)=f, 
in the form 
(13.2) u(t,a) = f a(t,x,g)el** fle) a, 


for f supported in a coordinate patch. The amplitude a(t,2,&) will have an 
asymptotic expansion of the form 


(13.3) a(t,x,€) ~ )° aj(t, 2, €), 


and the a;(t, x, €) will be defined recursively, as follows. By the Leibniz formula, 
write 


‘Ja 
L(a e**) = = — E(w, €)Dealt, ©, €) 
(13.4) iss : 
— ert [Lola €)alt, ye) — S- Bo_o(#, €, De alk, x, €) ’ 


f=1 


where B2_¢(x, €, D,.) is a differential operator (of order @) whose coefficients are 
polynomials in €, homogeneous of degree 2 — @ in €. 
Thus, we want the amplitude a(t, x, €) in (13.2) to satisfy (formally) 


2 
a —Le2a — » Bo_¢(x,€, Dx)a. 


If a is taken to have the form (13.3), we obtain the following equations, called 
“transport equations,” for a;: 


3) 
(13.5) P = —L(z, €)ao(t, x, €) 


62 7. Pseudodifferential Operators 


and, for 7 > 1, 
Oa; 
(13.6) DE = —L2(2, €)a;(t, x, €) + 0; (t, ar,&); 
where 
(13.7) Q;(t,2,€) = So Baul o(x, €, Dx)aj—e(t, 2, €). 


l=1 


By convention we set a_; = 0. So that (6.15) reduces to Fourier inversion at 
t = 0, we set 


(13.8) ao(0,7,€)=1, a,(0,2,€) =0, forz > 1. 
Then we have 
(13.9) ag(t, v, €) = etha(w.€) 


and the solution to (13.6) is 


t 
(13.10) a;(t,x,€) = | eH 1al@80 (sa, €) ds 
0 


In view of (13.7), this defines a;(t, x, €) inductively in terms of a;_1(t, x, €) and 


a;—2(t, x, E)s 
We now make a closer analysis of these terms. Define A; (t,x, €) by 


(13.11) a;(t, a, €) = Aj(t, 2, €)e*2 4), 
The following result is useful; it applies to A; for all j > 1. 


Lemma 13.1. /f = 0,1,2,..., v © {1,2}, then Ao,4, can be written in the 
form 


(13.12) Aopiv(t,t,€) =i" AR, (2,08), withw = t7¢, 


The factor Ay (a, w, €) is a polynomial in both w and €. It is homogeneous of 
degree 2 — v in & (i.e., either linear or constant). Furthermore, as a polynomial 
in w, each monomial has even order; equivalently, AS pv (@, —w,€) = 


AR (a Ww, £). 


To prove the lemma, we begin by recasting (13.10). Let Ij(t,a,€) be 
defined by 


13. Parametrix for the heat equation 63 
(13.13) Ogee) =TiGa be ee’, 


Then the recursion (13.7) yields 
2 
(13.14) Tye ao S- Bo_o(x, €, Dz) (Aj_ce7 2). 


Applying the Leibniz formula gives 


2 
(13.15) n=-y Ae(a,w) BY (x, €, Dz) Ay—e(t, 2, €), 


f=1 |yISé 


evaluated at w = t/2¢, where 


(13.16) eae) Dye tales) — An (am, #776), 


Clearly, A, (a, tt 2¢) is a polynomial in € and also a polynomial in t; hence 
A.,(z,w) is an even polynomial in w. Note also that the differential operator 

MM (a, €, D,) is of order £—|+|, 
geneous of degree 2 — ¢, as were those of Bz_¢(x,€, D,,). The factor A; is given 
by 


t 
(13.17) Aj (t,x, &) =| Tj (s,2,&) ds 
0 


The recursion (13.15)—(13.17) will provide an inductive proof of Lemma 13.1. 
To carry this out, assume the lemma true for Aj, for all 7 < 24+ v. We then 
have 


Poptr(t, x, €) = S- ye Ae(z, w) By Le, f, Dz ) Ax Qt — (nage 


1<e<v |y|<e 


(13.18) + S- Ag(a, w) )Bol foie ls ) Az ioe ge, we) 


YSlS2 |y|<e 
The first sum is empty if vy = 1. In the first sum, A oye (az, w,€) is homoge- 
neous of degree 2 + ¢ — v in €, so in the first sum 
(3. 19) rk y(Z, w) BY ACZ g D AEs 2u+y— p(X, w,€) = tH TT 4p, (2,0, 6), 


where ae (x, w, &€) is a polynomial in €, homogeneous of degree 4 — v, and an 


even polynomial in w. We can hence write 


64 7. Pseudodifferential Operators 
B+ pH — +h 
(13.20) t ir ty (2, €) = t Ayvey(x,w,&), 
where H,,,,¢,(a, w,€) is a polynomial in €, homogeneous of degree 2 — v, and an 
even polynomial in w. 


In the last sum in (13.18), re, is homogeneous in € of degree ¢— v, so in 
this sum 


(13.21) tA.(a,w) BI" (ce, €, De) AB, 4 4 2(@s0, £) = U Hyvey(a,w, 2), 


where, as in (13.20), Hyvey(x, w, €) is a polynomial in €, homogeneous of degree 
2 — v, and an even polynomial in w. Thus 


(13.22) Let Oat y Hew) tha we), 
£7 


where K,,, is a polynomial in €, homogeneous of degree 2 — v, and an even 
polynomial in w. It follows that 


t 
(13.23) Aggigltst £) = i s’K. (a, 8'/*E, €) ds 
0 


has the properties stated in Lemma 13.1, whose proof is complete. 
The analysis of (13.12) yields estimates on a,(t, x, €), easily obtained by writ- 
ing (for 7 = 2u+v,v = 1 or 2) 
(13.24) aie) = gar (a, w, Ee” ba (tw) /2 o—tha(a.6)/2. 
and using the simple estimates 


(13.25) JofheF2/? < Op, (t|g|?)Se tO? < Cy. 


Note that t+! = ¢3/? if j is even; if j is odd, then t+! = ¢3/? . t!/2, and the 
factor t!/? can be paired with the linear factor of € in A®. Thus we have estimates 


(13.26) |a;(t, x, €)| < C,t?/ 
and 
(13.27) eee sce. 


Derivatives are readily estimated by the same method, and we obtain: 


Lemma 13.2. For0 <t<T7T,k > —j, we have 


(13.28) t'/? Diaj(t,2,€) bounded in S75". 


13. Parametrix for the heat equation 65 


We can construct a function a(t,x,€) such that each difference a(t, x,&) — 
dec; a(t, x, €) has the properties (13.28), and then, for u(t, x) given by (13.22), 
we have u(0,x) = f(a) and 


3) 
(13.29) (5 + L) u(t,z) = r(t, x), 
where r(t, x) is smooth for t > 0 and rapidly decreasing as t \, 0. If the construc- 
tion is made on a compact manifold M, energy estimates imply that the difference 
between u(t,2) and v(t,2) = e~'” f(x) is smooth and rapidly decreasing as 
t \, 0, for all f € D’(.M). Consequently the “heat kernel” H(t, x, y), given by 


(13.30) a i H(t,x,y) f(y) dV(y), 
M 


and the integral kernel Q(t, x,y) of the operator constructed in the form (13.2) 
differ by a function R(t, x, y), which is smooth on [0,00) x M x M and rapidly 
decreasing as t \, 0. 

Look at the integral kernel of the operator 


(13.31) Q;(t,2z, D)f = [ote 2, £)e€ f(E) dé, 

which is 

(13.32) Q3(t, a, y) = (2m) i aj(t, x, Ee dé. 
Rv 


For a; (t, x, €) in the form (13.11)-(13.12), we obtain 


(13.33) Qy(t, x, y) 10 e qj (z, mae: = y)); 
where 
(13.34) aol, 2) = (20) i en Laleteix€ qe 
Rv 
and, for 7 > 1, 
(13.35) q(x, 2) = (2m) / AP (a, €, €)e- Mel de. 
R” 


We can evaluate the Gaussian integral (13.34) via the method developed in 
Chap. 3. If, in the local coordinate system used in (13.2), Do(x,€) = L(ax)€ - €, 
for a positive-definite matrix £(x), then 


66 7. Pseudodifferential Operators 


(13.36) qo(x, 2) = [det (4r£(c))] —1/2 e Ga)z2/4 


where G(x) = £(x)~'. Consequently, 


-1/2 
(13.37) Qo(t,2,y) = (4nt)-"/? [det L(z)| eG (e)(@— 9) (wy) /4t, 


The integrals (13.35) can be computed in terms of 


—1/2 Doe-He)z2/4 


jf rar aoe = [act (4n£(x)) | 


olen /4. 


(13.38) 
= pe(z,z)e- 


where pg(z, z) is a polynomial of degree |/3| in z. Clearly, pg(x, z) is even or odd 
in z according to the parity of ||. Note also that, in (13.35), At (x, €,€) is even 
or odd in € according to the parity of 7. We hence obtain the following result. 


Proposition 13.3. If L is a second-order, elliptic differential operator with pos- 
itive scalar principal symbol, then the integral kernel H(t, x,y) of the operator 
e—'” has the form 


(13.39) H(t,2,y) ~ St)? pj (2,47 (a — y)) FOC He w/e, 
j>0 


where p;(x, z) is a polynomial in z, which is even or odd in z according to the 
parity of j. 


To be precise about the strong sense in which (13.39) holds, we note that, for 
any v < oo, there is an N < oo such that the difference Ry(t, x,y) between 
the left side of (13.39) and the sum over 7 < N of the right side belongs to 
CY” (0,00) x M x M) and vanishes to order v as t \, 0. 

In particular, we have 


(13.40) H(t, 2,2) ~ S$ >t"? po, (2,0), 
jz0 


since p;(x,0) = 0 for 7 odd. Consequently, the trace of the operator e—'” has the 
asymptotic expansion 


(13.41) Trew ~ t-"/? (ag + ait + agt? +--+), 
with 
(13.42) a; = [pate.o) dV (a). 


M 


13. Parametrix for the heat equation 67 
Further use will be made of this in Chaps. 8 and 10. 
Note that the exponent in (13.39) agrees with r(x, y)?/4t, up to O(r?/t), for 


x close to y, where r(x, y) is the geodesic distance from x to y. In fact, when 
L = —A, the integral operator with kernel 


(13.43) Ho(t,z,y) = (4nt)-"/2 eet ts 0, 


is in some ways a better first approximation to e~” than is (13.2) with a(t, x, €) 
replaced by ag(t, x, €) = et42(#-§)_ (See Exercise 3 below.) It can be shown that 


(13.44) (2 +L ») Ho(t,©,y) = Q(t,z,y), t>0, 


is the integral kernel of an operator that is regularizing, and if one defines 
(13.45) Ao#-Q(t, 2, y) -[ [ t0 — 8,2, 2z)Q(s,z,y) dV(z) ds 


then a parametrix that is as good as (13.39) can be obtained in the form 


(13.46) ~ Ho — Ho#Q + Ho#Q#Q—----. 


This approach, one of several alternatives to that used above, is taken in [MS]. 
One can also look at (13.43)-(13.46) from a pseudodifferential operator per- 
spective, as done in [Gr]. The symbol of 0/0¢ + Lis it + L(x, €), and 


(13.47) Ho(x,7, €) = (ir + La(x,€)) € Sp} (Rx M). 


The Raed with integral kernel Ho(t — s,x,y) given by (13.43) belongs to 
OPS ie (IR x M) and has (13.47) as its principal symbol. This operator has two 
additional properties; it is causal, that is, if v vanishes for t < T’, so does Hov, 
for any T’, and it commutes with translations. Denote by C™ the class of operators 
in OPS. )(R x M) with these two properties. One easily has Pj; € C™ = 


P, Pp € C™:*™2, The symbol computation gives 


3) 
(13.48) (5 +4)Ho=1+9, Geet 
and from here one obtains a parametrix 
(13.49) Hecl, H~ H)—HQ+ HQ? - 


The formulas (13.46) and (13.49) agree, via the correspondence of operators and 
their integral kernels. 


68 7. Pseudodifferential Operators 


One can proceed to construct a parametrix for the heat equation on a manifold 
with boundary. We sketch an approach, using a variant of the double-layer- 
potential method described for elliptic boundary problems in §11. Let Q be an 
open domain, with smooth boundary, in M, a compact Riemannian manifold 
without boundary. We construct an approximate solution to 


Ou 
13. ee ey 5 
(13.50) - tis 
for (t,7) € R* x Q, satisfying 
(13.51) u(0,2) =0, u(t,x) = h(t,x), fora € OO, 


in the form 


13.52 u=Deg(t2)= [ f a(s.y) S(t 5,2.) d5(y) ds 
0 Wy 
ek9) 


where H(t,x,y) is the heat kernel on R* x M studied above. For x € OQ, 
denote by Dé gi (t,x) the limit of Dé g from within Rt x Q. As in (11.7), one 
can establish the identity 


1 
(13.53) Deg, = 5(1+N)g, 


where (1/2) Ng is given by the double integral on the right side of (13.52), with 
y and x both in OQ. In analogy with (11.23), we have 


N € OPS, 5 (Rt x 20). 


For u to solve (13.50)—(13.51), we need 


(13.54) h= st + N)g. 


Thus we have a parametrix for (13.50)—(13.51) in the form (13.52) with 
(13.55) gv2I—-N+N? —.--)h. 


We can use the analysis of (13.50)-(13.55) to construct a parametrix for the 
solution operator to 


(13.56) au = Au, forx EO, u(0,2) = f(a), u(t,2) =0, fora € aN. 


13. Parametrix for the heat equation 69 


To begin, let v solve 


(13.57) a = AvonR*+ x M, v(0) =f, 
where 

f(a) =f(x), forx€Q, 
(13.58) 


0, forre M\Q. 


One way to obtain u would be to subtract a solution to (13.50)—(13.51), with 
—-L=A,h= Up +xq: This leads to a parametrix for the solution operator for 
(13.56) of the form 


p(t, vy) = A(t, x,y) -f [roensnu a 3,62) dS(z) ds, 
(13.59) ° 32 . 


A(s,2,y) ~ 2H(8, 2,4) ++, 


where, as above, H(t, x, y) is the heat kernel on Rt x M. 

We mention an alternative treatment of (13.56) that has some advantages. We 
will apply a reflection to v. To do this, assume that 2 is contained in a compact 
Riemannian manifold M, diffeomorphic to the double of Q, andletR: M—> M 
be a smooth involution of M/, fixing 0Q, which near OQ is a reflection of each 
geodesic normal to 02, about the point where the geodesic intersects OQ. Pulling 
back the metric tensor on M by R yields a metric tensor that agrees with the 
original on OQ. Now set 


(13.60) ui(t,v) = v(t,z) — v(t, R(z)), «cen. 


We see that w1 satisfies 


Out 


(13.61) 7 Auitg, w(0,2)=f, wi(t,x) =0, forx € dQ, 
where 
(13.62) g= 20) 3) 0E@) SUE ae), 


and where L? is a second-order differential operator, with smooth coefficients, 
whose principal symbol vanishes on OQ. Thus the difference u — uy = w solves 
Ow 


(13.63) ae Aw-—g, w(0)=0, w(t,2) =0, for x € aN. 


70 7. Pseudodifferential Operators 


Next let v2 solve 


0 By 
(13.64) OF = Avy —JonRt+ x M, v9(0) =0, 
where 
g(t,7) = g(t,x), forx EQ, 
(13.65) 
0, forx € M\Q, 
and set 
(13.66) U2 = V2/ pi vo 


It follows that w2 = u — (ui + Ug) satisfies 


Ow 
(13.67) 7 = Aw, onR* x9, w2(0)=0,  Walps an = —Y2IR+ x00: 


Now we can obtain w2 by the construction (13.52)—(13.55), with 


h= ata | peg 


To illustrate the effect of this construction using reflection, suppose that, in 
(13.56), 


(13.68) f € Hy(Q). 
Then, in (13.57)-(13.58), f € H!(M), so v € C(Rt, H1(M)), and hence 
(13.69) u, € C(Rt, Hy (Q)). 


Furthermore, given the nature of L? and that of the heat kernel on Rt x M x M, 
one can show that, in (13.62), 


(13.70) g € C(R*, L?(Q)), 


that is, L> effectively acts like a first-order operator on 0, when one restricts to 
Q. It follows that g € C(R*, L*(M)) and hence, via Duhamel’s formula for the 
solution to (13.64), that vg € C(R*, H*"(M)), Ve > 0. Therefore, 


(13.71) ug € C(Rt, H?-*(9)), 


and, in (13.67), we have a PDE of the form (13.50)-(13.51), with h ¢€ 
C(Rt, H°/?-(0Q)), for all € > 0. One can deduce from (13.52)-(13.55) 
that wa has as much regularity as that given for w2 in (13.71). 

It also follows directly from Duhamel’s principle, applied to (13.63), that 


13. Parametrix for the heat equation 71 
(13.72) w € C(Rt, H?-(Q)), 


so we can see without analyzing (13.52)—(13.55) that w2 has as much regularity as 
mentioned above. Either way, we see that when f satisfies (13.68), the principal 
singularities of the solution u to (13.56) are captured by uj, defined by (13.60). 
Constructions of we and, via (13.52)-(13.55), of wz yield smoother corrections, 
at least when smoothness is measured in the spaces used above. 

The construction (13.56)—(13.67) can be compared with constructions in §7 of 
Chap. 13. 


Exercises 


1. Let L be a positive, self-adjoint, elliptic differential operator of order 2k > 0 ona 
compact manifold M, with scalar principal symbol L2x(x, €). Show that a parametrix 
for 0u/Ot = —Lu can be constructed in the form (13.2)-(13.3), with a;(t, x, €) of the 
following form, generalizing (13.11)—(13.12): 


a;(t,2,€) = A;(t, 2, Ee P49), 
where Ao(t,x,€) = land if =0,1,2,... andy € {1,...,2k}, then 


Arkp+u(t, v3 §) = BAG ads (2, w, §), w= he, 


where Ag we (x,w,&) is a polynomial in €, homogeneous of degree 2k — v, whose 
coefficients are polynomials in w, each monomial of which has degree (in w) that is an 
integral multiple of 2k, so AG pages eT/hy E) = AG pth w, €). 

2. In the setting of Exercise 1, show that 


Tre wu pone (a0 + art/* + agt?/* +... i 


generalizing (13.41). 

3. Let gjx(y,2) denote the components of the metric tensor at x in a normal coordi- 
nate system centered at y. Suppose —Lu(x) = Au(x) = g?*(y,x)Oj;O,u(x) + 
b) (y, x) 0;u(x) in this coordinate system. With Ho(t,«,y) given by (13.43), show 
that 


(5; + Ls) Ho(t, x,y) 
= Ho(t,2,y){ (24) [9 (w, 2) — 9”*(y, 2)] (aj — 9s) em — 9) 


— (2t)* [9 5(@,2) — 9° j(y,2) — By, 2)(a; — ys)] } 


= Hytce {0 (et) 6 a 


Compare formula (2.10) in Chap. 5. Note that gjx(y,y) = djx, Ocgjn(y,y) = 0, and 
b/ (y, y) = 0. Relate this calculation to the discussion involving (13.43)—-(13.49). 

4. Using the parametrix, especially (13.39), show that if MM is a smooth, compact Rieman- 
nian manifold, without boundary, then 


72 7. Pseudodifferential Operators 
es (My— CM) 


is a strongly continuous semigroup, for each k € Z*. 


14. The Weyl calculus 


To define the Weyl calculus, we begin with a modification of the formula (1.10) 
for a(a, D). Namely, we replace e*7'* e?? by e(¢*+P'P), and set 


(14.1) a(X, D)u = [ aa.petoxtr?) u dq dp, 


initially for a(a,€) € S(R?”). Note that v(t, 7) = e(¢*+P-P)u(x) solves the 
PDE 


O 
(14.2) 5 = Pigg tia), v0,2) = ule), 
j Jj 


and the solution is readily obtained by integrating along the integral curves of 
0/0t — >> p; 0/Ox;, which are straight lines. We get 

(14.3) eltX4+PD)ay(g) = eT ttirP/2 u(x + p), 

Note that this is equivalent to the identity 


(14.4) CANE) ae AUN 


If we plug (14.3) into (14.1), a few manipulations using the Fourier inversion 
formula yield 


c+y 


(14.5) a(X, D)u(x) = ony" f a .é) ell@-)€ u(y) dy dé, 
which can be compared with the formula (1.3) for a(x, D). Note that a(X, D) 
is of the form (3.2) with a(z,y,£) = a((x + y)/2,€), while a(x, D) is of the 
form (3.2) with a(x, y,€) = a(a,&€). In particular, Proposition 3.1 is applicable; 
we have 


(14.6) a(X,D) = W(x, D), 

where 

(47) Wn.) = efPePv g(AH g\] = el6/)PeP* a(x, 8). 
2 y=a 


If a(z,€) € Sis, with < 6 < p < 1, then d(x, €) also belongs to S’; and, 
by (3.6), 


14. The Weyl calculus 3 
j|ox| 
a —|a a a 
(14.8) b(w,€)~ D7) 2 |"! De Dea(a, €). 
a>0 


Of course this relation is invertible; we have a(a,€) = e~ “/2)Pe-P=b(x, €) and a 
corresponding asymptotic expansion. Thus, at least on a basic level, the two meth- 
ods of assigning an operator, either a(x, D) or a(X, D), to a symbol a(x, €) lead 
to equivalent operator calculi. However, they are not identical, and the differences 
sometimes lead to subtle advantages for the Weyl calculus. 

One difference is that since the adjoint of e(¢* +?) is e~44*+P'Y), we have 
the formula 


(14.9) a(X,D)* = 0(X,D), d(x, §) = ala, €)", 
which is somewhat simpler than the formula (3.13)—(3.14) for a(a, D)*. 

Other differences can be traced to the fact that the Weyl calculus exhibits cer- 
tain symmetries rather clearly. To explain this, we recall, from the exercises after 
§1, that the set of operators 


(14.10) ett ett X oir D — F(t, g,p) 


form a unitary group of operators on L?(IR"), a representation of the group H”, 
with group law 


(14.11) (ta,p)o(t.d.p)=(t+t+p-d,q+d,ptp’). 
Now, using (14.4), one easily computes that 
(14.12) ellttaX+p-D) eit’ +a! X+p'-D) _ ei(stuX+v-D) 
withu=q+q',v=p-+p’, and 

‘ : ! ! 1 roy 
(14.13) s=ttt+5(p-d—q-p)=t+t + 50((p,9), (7.9), 


where ga is the natural symplectic form on R” x R”. Thus 


(14.14) n(t,q,p) = ett aXtp-D) 


defines a unitary representation of a group we’ll denote H”, which is R x R?” 
with group law 


1 
(14.15) (t,w) -(t,w’) = (t +t + 50(w,w'),w+ w'), 


74 7. Pseudodifferential Operators 


where we have set w = (q,p). Of course, the groups H” and H” are isomor- 
phic; both are called the Heisenberg group. The advantage of using the group 
law (14.15) rather than (14.11) is that it makes transparent the existence of the 
action of the group Sp(n,IR) of linear symplectic maps on R?”, as a group of 
automorphisms of H”. Namely, if g : R?" — IR?” is a linear map preserving the 
symplectic form, so o(gw, gv) = o(w,v) for v, w € R?”, then 


(14.16) a(g):H" +H",  a(g)(t,w) = (t, gw) 

defines an automorphism of H”, so 

(14.17) (t,w) - (t', w") = (s,v) = (t, gw) - (t, gw’) = (s, gv) 

and a(gg’) = a(g)a(g’). The associated action of Sp(n, R) on H” has a formula 


that is less clean. 
This leads to an action of Sp(n, R) on operators in the Weyl calculus. Setting 


(14.18) te) =ala aS) 
we have 
(419). a(X, D)b(4,D) = c(X, D) = a,(X,D)0,(X, D) = e,(X,D), 


for g € Sp(n,R). 
In fact, let us rewrite (14.1) as 


a(X, D) = [awyn(0,u) dw. 


Then 


a(X, D)v(X, 2) 
(14.20) = ff aw) m(0, w)7(0, w") dw dw’ 
= [fw jerenne Me n(0,w+w’) dw dw’, 


so c(X, D) in (14.19) has symbol satisfying 
(14.21) é(w) = (20)-” / a(w — w')b(w' ei /? day’. 


The implication in (14.19) follows immediately from this formula. Let us write 
c(x,€) = (ao b)(x, €) when this relation holds. 
From (14.21), one easily obtains the product formula 


14. The Weyl calculus 75 


(14.22) (a0 b)(a, €) = e/) (Pu Pe-Pa Pu)a(a, €)b(y, 7) s 
y=u n= 


Tf a ois 


expansion: 


be S¥.0 < 6 < p < 1, we have the following asymptotic 


p,0” 
(14.23) (a0b)(2,€)~ a+ 5 = {a, bs (a,§), 
jp? 


where 


(14.24) {a,b};(0,€) = (—$)" (Oy de — Be + By) a(x, €)6(ys7) 


y=e,n=— 
For comparison, recall the formula for 

(14.25) a(x, D)b(a, D) = (a#b) (a, D) 

given by (3.16)-(3.20): 


(a#b)(a, €) = e'?" Pua(x,)b(y, €) i eee 
(14.26) | 


(x, €) 0; b(@, €). 


a>0 


In the respective cases, (a o b)(x,€) differs from the sum over j < N by an 
element of ae N(-9) and (a#b) (x, €) differs from the sum over |a| < N by 


an element of the same symbol class. 
In particular, for p = 1, 6 = 0, we have 


(1427) (wo N(x.) = a(x, €)0(e,€) + 5{a,D}(x,6) mod Sr", 


where {a, b} is the Poisson bracket, while 
0 a = 
(14.28) — (a##b)(2,€) = a(z, 8 oe = mos 


Consequently, in the scalar case, 


[a(X, D), b(X, D)] = [a(a, D), b(a, D)| 


14.29 
en = e(#, D) = e(X,D) mod OPSTy? , 


with 


(14.30) e(a,€) = i{a, b}(a, €). 


76 7. Pseudodifferential Operators 


Now we point out one of the most useful aspects of the difference between 
(14.27) and (14.28). Namely, one starts with an operator A = a(X,D) = 
a,(a,D), maybe a differential operator, and perhaps one wants to construct 
a parametrix for A, or perhaps a “heat semigroup” e~'4, under appropriate 
hypotheses. In such a case, the leading term in the symbol of the operator 
b(X, D) = bi (x, D) used in (14.20) or (14.25) is a function of a(x, €), for exam- 
ple, a(a, €)~1, or e~ (8), But then, at least when a(sx, €) is scalar, the last term 
in (14.27) vanishes! On the other hand, the last term in (14.28) generally does 
not vanish. From this it follows that, with a given amount of work, one can often 
construct a more accurate approximation to a parametrix using the Wey] calculus, 
instead of using the constructions of the previous sections. 

In the remainder of this section, we illustrate this point by reconsidering the 
parametrix construction for the heat equation, made in §13. Thus, we look again 
at 


Ou 
(14.31) ROE UO)=F 
This time, set 
(14.32) Lu = a(X, D)ut+ W(x)u, 
where 
GF (EER + Db; ( 
(14.33) = Dig (wake + 


= g(a, €) + &(@, €). 


We assume g(x, €) is scalar, while €(x, €) and b(x) can be K x AK matrix-valued. 
As the notation indicates, we assume (g?*) is positive-definite, defining an inner 
product on cotangent vectors, corresponding to a Riemannian metric (g;,). We 
note that a symbol that is a polynomial in € also defines a differential operator in 
the Weyl calculus. For example, 


O(a, D)u = >— 8; (a 2 => 


(X, D)u= > b;(x) dju+ = 5 (aytsu 


(14.34) 


and 
=. Aj (x x)O; Onu => 
(14.35) a(X,D)u= ee )OjOnu + (Ojajx)Opu+ — +8, yajp)u 


— S- [2j(ajnAeu) + + (2jOvayn)u 


14. The Wey] calculus 77 


We use the Wey] calculus to construct a parametrix for (14.31). We will begin by 
treating the case when all the terms in (14.33) are scalar, and then we will discuss 
the case when only g(, €) is assumed to be scalar. 

We want to write an approximate solution to (14.31) as 


(14.36) u= E(t, X, D)f. 
We write 
(14.37) Et, myf) ~ Hots @, 2) + Bilt, 2,6) +s 


and obtain the various terms recursively. The PDE (14.31) requires 


(14.38) © Blt X,D) = —LE(t, X, D) = —(Lo E)(t, X, D), 


where, by the Weyl calculus, 


(14.39) (Lo E(t, x, €) ~ L(,€) E(t, 2,6) + > ah E},(t, «, €). 


pil 
It is natural to set 
(14.40) Eo(t, x, €) = ea), 


as in (13.9). Note that the Weyl calculus applied to this term provides a better 
approximation than the previous calculus, because 


(14.41) {a,e~**}, =0. 


If we plug (14.37) into (14.39) and collect the highest order nonvanishing terms, 
we are led to define E(t, x,€) as the solution to the “transport equation” 


OE 1 
(14.42) ap = TOE — 5a, Bohs — W(0) Eo, E1(0,2,€) =0. 
Let us set 
1 
(14.43) Oi(t,x,€) = —3Ka, e**}, — b(x)e ta), 


Then the solution to (14.42) is 


t 
(14.44) Ex (t,x, €) = | eal) 0, (8, x, €) ds. 
0 


78 7. Pseudodifferential Operators 


Higher terms E;(t,z,€) are then obtained in a straightforward fashion. This 
construction is similar to (13.6)-(13.10), but there is the following important dif- 
ference. Once you have Fo(t,x,€) and F,(t,2,&€) here, you have the first two 
terms in the expansion of the integral kernel of e~'” on the diagonal: 


(14.45) K(t, x, x) ~ co(x)t-"/? + ey (2)t- 7/241 4... 


To get so far using the method of §13, it is necessary to go further and compute 
the solution a2(t, x, €) to the next transport equation. Since the formulas become 
rapidly more complicated, the advantage is with the method of this section. We 
proceed with an explicit determination of the first two terms in (14.45). 

Thus we now evaluate the integral in (14.44). Clearly, 


t 
(14.46) 4 els—Halz8) b(7)e— SU) ds = th(x)e— ta), 
0 


Now, a straightforward calculation yields 


2 
(14.47) {a,e8*}o = 5Q(V?a)e~* = TT (Va, V2a)e74, 


where 


(14.48)  Q(V2a) = So { (8, ,4)(Ar, Prva) a (e, x4) (Bn, Oca) } 


ke 
and 
T(Va, V’a) 
(14.49) — 24 (0 a) (Ox, A) (Oz) 
+ (A, Ore) (Og), 4) (Ae,4) — 2(Ae, Ire) (In) (Deva) }. 
Therefore, 


t t? ‘ha 
(14.50) | e-4fq, 1, ds = TaVraje = lve Vaje™. 
0 


We get Ei (t,x, €) in (14.44) from (14.46) and (14.50). 
Suppose for the moment that @(z,€) = 0 in (14.33), that is, a(X,D) = 
g(X, D). Suppose also that, for some point x, 


(14.51) Veg? (ao) = 0, 9'"(x0) = Oj. 


14. The Weyl calculus 719 


Then, at xo, 
Q(V7a) = D1 (9,964) (Ox, Oxe4) 
ke 
(14.52) - sf (a0)€;Ex 
Uke 
and 
T(Va, Va) = S~ (Ax, Axe) (De, a) (Be) 
(14.53) ° 


0g jk 
<4 Fear (todEsEnkeem. 


Such a situation as (14.51) arises if g/* (x) comes from a metric tensor jx (2), 
and one uses geodesic normal coordinates centered at xg. Now the Laplace- 
Beltrami operator is given by 


(14.54) Au=g 5° ajgi* 9/70, u, 


where g = det(g;,). This is symmetric when one uses the Riemannian volume 
element dV = /g dx, ---da,. To use the Weyl calculus, we want an operator 
that is symmetric with respect to the Euclidean volume element dx, ---dzp, so 
we conjugate A by multiplication by g!/*: 

(14.55) —La = g'/* A(g tu) = g VAY” Ojg?* gl? Gy (g/4u). 
Note that the integral kernel ké. (x, y) of e' is g\/4(x)k\ (x, y)g—!/4(y); in par- 


ticular, of course, the two kernels coincide on the diagonal z = y. To compare L 
with g(X, D), note that 


(14.56) —Lu => djg"* Ou + O(x)u, 
where 
(14.57) B(a =5~a(g (gi* ga | ~S> gi*g 12g ee) (Ox.g71/*). 


If g3* (a) satisfies (14.51), we see that 


(14.58) = Li9 “V4 (a9) = =~] eal 20). 


80 7. Pseudodifferential Operators 


Since g(x + hee) = det (d;~ + (1/2)h? O?9;x~) + O(N), we have 
1 
(14.59) ®(x0) = 5 2 2695;(20) 


By comparison, note that, by (14.35), 
9(X,D)u ~ 200" Ou + W(e)u, 
4 7 I; Arg?" (x) 


If xo is the center of a normal coordinate system, we can express these results 
in terms of curvature, using 


(14.60) 


: 1 1 
(14.61) Oe0mg7* (ao) = 3 Rjekm(wo) + 3 Rjmne(vo); 


in terms of the components of the Riemann curvature tensor, which follows from 
formula (3.51) of Appendix C. Thus we get 


i 2 
(x9) = =i 3 S > Ryese(20) = —=S(2o0), 
(14.62) ‘af a i 
U(mo)=-7°5 S"[Rigae (wo) + Ryray(ao)] = 7p (20). 
j,k 


Here S' is the scalar curvature of the metric gj. 
When a(X, D) = g(X, D), we can express the quantities (14.52) and (14.53) 
in terms of curvature: 


(14.63) — Q(V2q) => Ryexe(xo)&jx = + Riese oes: 
8 


Oat 


where Ric;; denotes the components of the Ricci tensor, and 


(14.64) T(Vg, V29) >> Ryeem(H0)bjExeeEm = 9, 
a5 k,l,m 


the cancelation here resulting from the antisymmetry of Rjekm in (j, 2) and in 
(k,m). 

Thus the heat kernel for (14.31) with 
(14.65) Lu = g(X, D)u+ b(x)u 


is of the form (14.36)-(14.37), with Eg = e~t9(@-§) and 


14. The Weyl calculus 81 


i c 
By(t,2,€) = (—t6(2) — [Q(V"s) + FT(V9,V29) Je" 
(14.66) 


t2 
= — (10(2) a Ric(E, €)) et), 


at x = 2p. Note that g(a, €) = |€|?. 
Now the integral kernel of E(t, X, D) is 


(14.67) K;j(t,2,y) = ony" |; (t, * - *é) el(—9)§ de, 
In particular, on the diagonal we have 
(14.68) K,(t,«,2) = (2n)-” / By (t,a,£) dé. 


We want to compute these quantities, for 7 = 0,1, and at x = xo. First, 
(14.69) Kol, 20,20) = (2m)~” / ent? ag = (Amt)-"/?, 


since, as we know, the Gaussian integral in (14.69) is equal to (1/t)”/?. Next, 
(27)" y(t, 20, Zo) 


14.70 2 2 
“se = —tb(x) / eW HEN de — © Ricsu(00) / Ejfxe Fl dé. 


We need to compute more Gaussian integrals. If 7 # k, the integrand is an odd 
function of €;, so the integral vanishes. On the other hand, 


[geteag= = | \eiPe-teP ag 


(14.71) 
= -= Sf tag = 1 m/e m/aot 

Thus 

(14.72) Ka(t,t0,:t0) = —(4nt)“"/? (8(20) + = $(20))) 


since > Ric;;(~) = S(a). 

As noted above, the Laplace operator A on scalar functions, when conjugated 
by g'/4, has the form (14.65), with b(a) = ®(29) — V(ao) = —S(ao) /4. Thus, 
for the keat kernel e‘“ on scalars, we have 


82 7. Pseudodifferential Operators 
t 
(14.73) K,(t, £0, £0) = (4nt)-"/? — S(20). 
We now generalize this, setting 


(14.74) a(x, €) = gz, €) as (a, £), E(x, €) = Saws. 


Continue to assume that a(«,€) is scalar and consider L = a(X, D) + b(x). We 
have 


(14.75) E(t, 2, €) = e 8) — eHl@.8) etol@.é) 
and 1 (t, x, €) is still given by (14.42)-(14.50). A point to keep in mind is that we 
can drop ¢(x,€) from the computation involving {a, e~'}o, altering Ky(t, x, 7) 


only by o(t~”/2+1) as t \, 0. Thus, mod o(t-"/2+1), K1(t, xo, 20) is still given 
by (14.73). To get Ko(t, 70, 20), expand e~*(*-§) in (14.75) in powers of t: 


2 
(14.76) E(t, x, €) ~ [1 — t&(ax,€) + ele, 2 eee eW t9(@8) 


When doing the €-integral, the term t¢(z,&) is obliterated, of course, while, by 
(14.71), 


t? 2 1 
(14.77) o evo, ger d& = i ia Geo)". 


Hence, in this situation, 


Ko(t, £0, 20) + Ki(t, £0, 20) 


(14.78) _ (4mt)-™2[1 i oy ¢; (x9)? — b(ao) — 5(e0)) + o()). 


Finally, we drop the assumption that @(a,€) in (14.74) be scalar. We still 
assume that g(x,€) defines the metric tensor. There are several changes whose 
effects on (14.78) need to be investigated. In the first place, (14.41) is no longer 
quite true. We have 


ieee. sie Oa O _,, Oa O _4, 
(14.79) {a,e"*}, = 5 ) i= a6, iE, BE,” : 
In this case, with a(x, €) matrix-valued, we have 
O |, ee Oa 
7 enta = _te-" B(ad(— alee 
Ba; e e (ad(—ta)) (=) 
(14.80) 


= —te~"* (ad(—t0)) (52) 


14. The Weyl calculus 83 


where &(z) = (1 — e~*)/z, so 
O 4 = (5 | | ) 
~—e “=te | ~—+—-/£,-—|+-:> 
(14.81) Ox; i" Ox; 2| ° Ox; 
= -t— + Ot’ |élje* +---, 
Ox; 


and so forth. Hence 


ol Oe 
(14.82) {a,c}, = Dla ml to, 
J 


This is smaller than any of the terms in the transport equation (14.42) for £1, so 
it could be put in a higher transport equation. It does not affect (14.78). 
Another change comes from the following modification of (14.46): 


t 
| esta) p(n) e808) dg 
(14.83) ‘ 
=| | ele-DUC) (qe 8H) qa] - e~tal8), 
0 


This time, b(x) and ¢(x, €) may not commute. We can write the right side as 


t 
| e ad £(x,€) [b(z)] ds e t(@,€) e—ta(a,€) 
(14.84) Jo 


= tf 6(x) — 5 (Cx, €)0(x) + (a) E(x, €)) += bert. 


Due to the extra power of ¢ with the anticommutator, this does not lead to a change 
in (14.78). 

The other change in letting ¢(a, €) be nonscalar is that the quantity @(a, €)? = 
> 6; (x) ln (x)E;€, generally has noncommuting factors, but this also does not 
affect (14.78). Consequently, allowing ¢(a,€) to be nonscalar does not change 
(14.78). We state our conclusion: 


Theorem 14.1. /f Lu = a(X, D)u+ b(x)u, with 


(14.85) = So g'* (an + DS &i(a)&, 


where (g?*) is the inverse of a metric tensor (g;x,), and £;(a) and b(a) are matrix- 
valued, and if g;x(%o) = Ojn, V9jx(%0) = 0, then the integral kernel K(t, x,y) 
of e~™ has the property 


(14.86) ; 
K(t, 20,00) = (4nt)-"/?|1 + #() > 6)(0)? — (0) — 55 $(wo)) + 0). 


84 7. Pseudodifferential Operators 
Exercises 


1. If a(x, €) = S> aa(x)E* is a polynomial in €, so that a(x, D) is a differential operator, 
show that a(X, D) is also a differential operator, given by 


a(X, Dyu(e) = 7 Dy [au (=F*) ui] | 


=. > (\e tl Daa (x) D?u(z). 


a B+y=a 


Verify the formulas (14.34) and (14.35) as special cases. 
2. Ifp € Sj andq € So are scalar symbols and p o q is defined so that the product 
p(X, D)q(X, D) = (po q)(X, D), as in (14.22)-(14.23), show that 
qopoq=q ‘pinod Sry : 


More generally, if pjx € Si%o, Dik = Pej, and qj € S¥‘9, show tha 


S24 o Dik © Ge = D> ayPjnde mod STH. 
j jk 


Relate this to the last identity in (14.35), comparing a second-order differential operator 
in the Weyl calculus and in divergence form. 


15. Operators of harmonic oscillator type 


In this section we study operators with symbols in S{”(IR”), defined to consist of 
functions p(a, €), smooth on R?” and satisfying 


(15.1) |DEDEp(w,8)| < Caa(1 + lal + lel". 


This class has the property of treating x and € on the same footing. We define 
OPS{"(R") to consist of operators p(X, D) with p(xz,&) € Sj” (IR"). Here we 
use the Weyl calculus, (14.5). In this setting, the Sp(n, R) action (14.18)-(14.19) 
can be well exploited. This action does not preserve S/o (R”), but it does preserve 
Si” (R”). The class OPS?" (R”) has been studied in [GLS, Ho4], and [V], and 
played a role in microlocal analysis on the Heisenberg group in [T2]. 

Note that 


(15.2) S?(R") C $?o(R”), 
so it follows from Theorem 6.3, plus (14.6)—(14.8), that 


(15.3) P € OPS?(R”) => P: L?(R") > L7(R"). 


15. Operators of harmonic oscillator type 85 


If a € Si! and b € S", variants of methods of §3 and (14.22)-(14.24) give 


(15.4) a(X, D)b(X, D) = (a0 b)(X, D) € OPS***(R"), 

with 

(15.5) (a0 )(0,8) ~ ab+ J) {a,b},(0,8), 
jai" 


where {a,b}; is given by (14.24). Note that 
(15.6) faces = (R%. 


We mention that if either a(a, €) or b(x, €) is a polynomial in (x, €), then the sum 
in (15.5) is finite and provides an exact formula for (a o b)(a, €). 

The set of “classical” symbols in Sj”(R”), denoted S™(R”), is defined to 
consist of all p(x, €) € S7”(R”) such that 


(15.7) pajtyn YS” m(@,2); 


j20 


with p; (a, €) smooth and, for |x|? + |€|? > 1, homogeneous of degree m — 2j in 
(x, €). The meaning of (15.7) is that for each NV, 


N 


(15.8) p(a,é)— >> pj(a,€) € SI?" (R”). 


-1 
j=0 


It follows from (15.4)-(15.6) that 
a€é S™(R"), bE S"(R") = > a(X, D)d(X, D) 
(15.9) = (a0b)(X,D),aobe S™*#(R”). 


Sobolev spaces tailored to these operator classes are defined as follows, for 
keZ. 
H*(R") = {u € L?(R”) : Pu € L?(R"), VP € D*(R”)}, 
(15.10) 
D*(R") = spanof «°D%, |a|+ || < k. 


Note that D* (R”) C OPS*(R"). The following Rellich type theorem is straight- 
forward: 


(15.11) The natural inclusion #*(IR") <> L?(IR") is compact, Vk > 1. 


86 7. Pseudodifferential Operators 
The results (15.4) and (15.3) yield, for k € Zt, 
(15.12) A€ OPS; *(R") = A: L?(R") > H-*(R"). 


We will obtain other Sobolev mapping properties below. These spaces will be seen 
to be natural settings for elliptic regularity results. 
An operator P = p(X, D) € OPS{"(R”) is said to be elliptic provided 


(15.13) p(x, €)*| < C1 + [al + |é)I)-™, 


for ||? + |€|? sufficiently large. With the results (15.4)-(15.6) in hand, natural 
variants of the parametrix construction of §4 yield for such elliptic P, 


Q€ OPS~™(R"), PQ=I+R,, QP=I+R, 


(15.14) R; € OPS>™(R") = ( ) OPS; *(R”). 


k>1 
Clearly, for each m € R, 
(15.15) Am(2,€) = (1+ |a/? + |g?) 
is the symbol of an elliptic operator in OPS/"(R"”). We have 
(15.16) Am(X,D)A_m(X,D)=I+Rm, Rm € OPS;*(R"). 
In this situation, (15.5) applies, and {Aj;,, A-m}1 = 0. 
We now introduce the central operator in this class, the harmonic oscillator, 

n g 

(15.17) H=—A+|ef =) o(—5aa +4): 
j= 


This is an elliptic element of OPS?(R”), with symbol ||? + |€|?. It defines a 
positive, self adjoint operator on L?(IR”). Note that 
Lj =0;+ 2; => Lj = —-0; +2; 
(15.18) = LjL;=—d} +a} -1 
= H= x LiL; +n, 


so H is positive definite, with H~! bounded on L?(R”). The following result will 
be very useful. 


Theorem 15.1. For all s € (0,00), H~* € OPS~?8(R"). With Am(a,€) as in 
(15.15), 


15. Operators of harmonic oscillator type 87 
(15.19) H-* — A_»,(X,D) € OPS-**-*(R”). 
We postpone the proof of Theorem 15.1 and observe some of its consequences. 
Proposition 15.2. Fork € Zt, 
(15.20) H-*/? ; 7?(R") — H*(R") 
is an isomorphism. 


Proof. The mapping property (15.20) follows from Theorem 15.1 and (15.12). If 
k = 20 is even, the two sided inverse to (15.20) is 


(15.21) H*; H74(R") — L7(R"). 
We need to show that if k = 2@ — 1 is odd, 
(15.22) HR/? — pyt-4/2 , 724-17”) __, 7? (R"), 


Indeed, take u € H?4~1(IR"). Then 


(15.23) H'u= 5° Xjuj, uj € L7(R"), Xj € D'(R"), 
and hence 
(15.24) Pa > A Xa, 


which belongs to L?(R”) since H~!/?X; € OPS°(R"). 
Given Proposition 15.2, it is natural to set 
(15.25) H?(R") = H-*/?1?(R*), 


for s € R, and we have that this space agrees with (15.10) fors = k € Z*. For 
S > 0, this says 


(15.26) H3(R") = D(H*/). 


Thus, by Proposition 2.2 of Chap. 4, we can identify #°(IR”) with the complex 
interpolation space: 


(15.27) H4(R") = [L7(R"), H*(R")Jo, 5 =kO, 0 € (0,1). 


Also note that 


88 7. Pseudodifferential Operators 


(15.28) () wR") =s(R"), LU xR) =S'R”). 


8<0O s>—0o 


In fact, (15.10) gives Ngez+H*(R") = S(R"), and (15.25) gives H°(R")! = 
H-*(R"). 
Given Theorem 15.1, it easily follows that 


(15.29) H*® € OPS**(R"), VseER. 


In fact, given s > 0, take an integer k > s and write H* = H* H*-*., Also, given 
(15.25), we have, for all m,s € R, 


(15.30) P € OPS™(R") => P:H°(R") > H3-™(R"). 
Indeed, P = H-(@—™)/2(H(—-™)/? PH-s/2) Hs/2, and H@-™)/2PH-s/? ¢ 
OPS}(R"”) is bounded on L?(R”). 


We will approach the proof of Theorem 15.1 via the identity 


1 
T(s) 


(15.31) H-* = i ete dt => 0. 
0 


Thus we have the task of writing 

(15.32) et = hg XD) 

and computing h; (x, €). We need to solve 

(15.33) <n X, D) =—Hh,(X,D), ho(x,é) =1. 

Taking 

(15.34) b(X,D) = Hhi(X,D), H=Q(X,D), Q(#,§) = |a)? +I€)?, 


since Q(x, €) is a polynomial, the formula (15.5) for composition is a finite sum, 
and it is exact: 


2 
(15.38) Belay) = Ole.) helasd) + > LQ hes (@.8): 
oe 


Now we make the “guess” that for each t > 0, hi(a, €) is a function of |a|? + 


Ie? = Q, 


(15.36) hi(x, €) = g(t, Q). 


15. Operators of harmonic oscillator type 89 


In that case, {Q, hi}1 = 0, and (15.33)-(15.35) lead to the equation 


Oh 
(15.37) *(a,£) = —(lal? +e? ra(e +7 D(ga+ aa) le, 6), 


with initial condition ho(x, €) = 1, or equivalently to solve 


Og 


(15.38) DE 


2 
= Q9+Qe4 p22, BOOS” 


Q?  aQ’ 
We now guess that (15.38) has a solution of the form 
g(t, Q) = ae?, 


Then the left side of (15.38) is (a’/a + b’Q)g and the right side is (—Q + Qb? + 
nb)g, so (15.38) is equivalent to 


(15.39) A 2 aie, YW) =8e) 4, 


We can solve the second equation for b(t) by separation of variables. Since 
g(0, Q) = 1, we need b(0) = 0, and the unique solution is 


(15.40) b(t) = — tanht. 

Then the equation a’/a = —n tanht with a(0) = 1 gives 
(15.41) a(t) = (cosh t)~” 

We have our desired formula 

(15.42) hi(z, €) = (cosh t)~"e~ ¢an2 Y (lel? +1817) 


We discuss briefly why the “guess” that h;(x,€) is a function of |a|? + |£|? 
was bound to succeed. It is related to the identity (14.19) for the composition of 
operators transformed by ag(x,€) = a(g7'(z,€)), g € Sp(n,R). If we identify 
R?” with C” and (x, €) with x + 7€, then the unitary group U(n) acts on C” = 
R?", as a subgroup of Sp(n, R), preserving |a|? + |€|? = |a + i€|?. It follows 
from (14.9) that the set of operators whose symbols are invariant under this U (7) 
action forms an algebra. From there, it is a short step to guess that e~'” belongs 
to this algebra. For more details, see Chap. 1, §7 of [T3]. 

We return to the identity (15.31), which implies 


(15.43) H~* =Q_,(X,D) 


90 7. Pseudodifferential Operators 
with 


(15.44) Q_,(a,é)= fl t®—!(cosh t)~"eW (anh De? HEI?) ae. 
T(s) Jo 


To complete the proof of Theorem 15.1, it remains to show that, whenever s > 0, 
(15.45) Q_s(x,€) € S~79(IR"), and Q_.(z,€)—A_2s(z,€) € S-7*-?7(R”). 


To begin, it is clear by inspection that Q_, € C°(IR?”) whenever s > 0. Also, if 
we set 


1 

(15.46) e (az, €) = =a t°—} (cosh t)~"e (Hanh Ole? HEI") ge 
T(s) Jo 

we easily get 

(15.47) Q_s(x,€) — Q? (2,6) € S7™(R”). 


We can set 7 = tanht and write 


(15.48) & (z,€) = T(s) [ rely(rye77 lel’ 161”) dr, 
0 


with a = tanh 1 and y € C™((0, a]), with power series 
(15.49) p(t) ~1+ bir? + bert +. 


Thus, as |x|? + |€|? — 00, we have 


(15.50) Qaeda ga 8); 


20 
with 
bj .-r(lal?-+1el?) -+2/-1 
q—s,j(@,€) = <> € T dr 
0 


(15.51) Ms) . 
(le? +1gP)°™, 


I(s + 27) 
T(s) 


and bo = 1. This proves Theorem 15.1. 


bj 


Remark: We can sharpen (15.45) as follows. Replace A_2,(x,&) by A_2,(x, g), 
smooth on R2” and equal to (|x|? + |€|?)~?° for |x|? + |€|? > 1. Then 


(15.52) O_,(«,£)— A_»,(z,t) € 8-7 (R"). 


Exercises 91 


We make a further specific study of the harmonic oscillator H in §6 of Chap. 8, 
including results on the eigenvalues and eigenfunctions of H, and an alternative 
approach to the analysis of the semigroup e~'”. 

We can extend the Rellich type result (15.11) as follows. By (15.11) and 
(15.20), we have H~! compact on L?(IR”), so H~" has a discrete set of eigen- 
values, tending to 0, and hence so does H~? for all o > 0. Thus H~° is compact 
on L?(R”), and, by (15.26), also compact on H$(R”), for all s € R. This gives 
the following. 


Proposition 15.3. Given r < s € R, the natural inclusion 
(15.53) H*(R") 3 H"(R"”) 


is compact. 


If P € OPS{"(R”) is elliptic (say a k x k system), and Q € OPS;™(R”) is 
a parametrix, as in (15.14), we see that the operators R; are compact on 1°(R”) 
for all s, so we have the following. 


Proposition 15.4. If P © OPS{"(R") is elliptic, then, for all s € R, 


(15.54) P:H*(R”) — H*®™(R") is Fredholm. 
Also 
(15.55) Ker P, Ker P* c S(R"), 


and the index of P is independent of s. 


Material on the index of elliptic operators in OPS™(R") will be covered in 
§11 of Chap. 10. See the exercises below for some preliminary results. 


Exercises 


1. Incase n = 1, consider 
D=4+%1. 


Show that D1) € OPS'(R) is elliptic, and that 
Index D; = 1. 


2. Incase n = 2, consider 


Do- O1+21 02-22 
: d2+¢2 -01+21}— 
Show that D2 € OPS*(R?) is elliptic and that 


Index Dz = 1. 


92 7. Pseudodifferential Operators 


3. In the setting of Exercises 1-2, compute Dj Dj; and D;Dj;, and compare with H. This 
should help to compute the kernels of D; and Dj. 


16. Positive quantization of C'™°(S*M) 


Let M be a compact Riemannian manifold. A quantization of C°(S*M) is a 
continuous linear map 


(16.1) op : C°(S*M) —>+ £L(L?(M)) 


with the property that for each a € C™(S*M), op(a) is a pseudodifferential 
operator of order 0, whose principal symbol is a. One such quantization, called 
the Kohn-Nirenberg quantization, 


(16.2) open : C™(S*M) — OPS®,(M), 


is obtained via a partition of unity and local coordinate charts, specializing (10.3). 
Here we desire to modify this to produce a positive quantization, i.e., one satisfy- 
ing op,-(a)* = op;-(a) for real valued a, and 


(16.3) a>0onS*M = (opp(a)u,u) >0, Vue L?(M). 


We use a symmetrization procedure, introduced by K. Friedrichs in his approach 
to a result known as the sharp Garding inequality. 

We begin the construction in the Euclidean space setting, where op xy (a) can 
be written 


(16.4) b(a, D) = (2n)~"/? / b(a, €)e"*Sa(€) dé, 
with 
(16.5) b(x, €) = (1 — x(€))a(a, €/IE]), 


where x € O§°(R”) is radial, 0 < x < 1, x(€) = 1 for |€| < 1, 0 for |é| > 2. 
We look at symmetrizations of the form 


(16.6) op p(a) = / Qu (D)* Mac.) Pu(D) dS(w), 


where 


(16.7) May.) u(z) = a(z,w)u(z), 


16. Positive quantization of C™°(S* M) 93 


and 

(168) pol Dus) = (2n)-"!? fp (G)alQe™* as 
Assume a € C§°(R” x S"~+). We set 

(16.9) b.(@,D) = gu(D)* Marv) Pu(D), 

and note that, if y,,(€) satisfies mild bounds, 


(16.10) (bu (x, D)u, u) = [ zw)leo(Du(o)P dx, Vue S(R"). 


The functions y,,(€) we use arise as follows. Pick p € (0,1), take 


(16.11) w € C>°(R”), radial, real valued, 
and set 
(16.12) pa (€) = (1— x(é))d(lel-P Ps), 


where P., is the orthogonal projection of R” onto the span of w, and P+ = I—P,,, 
so 


(16.13) PrE=E-(w- jw. 
We take vy as in (16.5). Then 

0 0 n 
(16.14) pu E Si(R ). 
Finally, we set 


(16.15) gul€) = lEl7e2 (6), 


with y > 0 to be selected shortly. 
This construction yields 


(16.16) Go € S(R"), b.(x,D) € OPS77(R"), 
with 

jlo 
(16.17) bu(@,€) ~ D>) De eu (€) Dea(x, w)pu(6), 


a>0 


94 7. Pseudodifferential Operators 


hence 
(16.18) 
/ bu (@,£)dS(w) ~ / bP dG, Basta) 
Sn-1 Sn-1 
jlel 
+f oO Gaiai(,4) 9016) a5(w). 
ja|>1 * gna 


There is a unique value of y such that 


(16.19) i: yu(E)?a(x,w) dS(w) — Ca(z,¢), as || 4 00, ¢ = a 


Sn-1 
To obtain 7, note that 
(16.20) ip grt) dS), CEs 
gr-1 
is independent of ¢ € S"~+, and is equal to 


/ B(ro2ds(¢), Vue sr} 


Sn-1 
(16.21) = f vPP3e? ds(6), (for > 2) 
Sn-1 
~ Cr-G-P™-)) as ¢ + 00. 
Thus we take 
1 
(16.22) y= 5(1-p)(n—1), 


and have that, for € = r¢, r = |€| > 1, 


/ yu (Pa(a,w) d5(w) 
Sn-1 


(16.23) =e Pe) i; Wri P5C)?a(a, w) dS(w) 
gr-1 


~ ale,6) + Sor *Pax (a, 0), 


k>1 


16. Positive quantization of C°° (S* M) 95 
as r — oo, provided ~ is multiplied by a constant so that C in (16.21) is 1. (In 
fact, one has az, = 0 for k odd.) 


Rather than applying a similar argument to the other terms on the right side of 
(16.18), we will simplify our task by requiring that p is sufficiently close to 1 that 


1 
(16.24) 2y <p, equivalently, p>1—-—. 
n 


Thus, in (16.17), 


(16.25) bu (w,£) = pu(€)’a(x,w) mod Bon 

hence 

(16.26) / bu (a, €) dS(w) = a(x, ¢), mod so? Riis ae 
gr-1 


We conclude that, with op -(a) defined by (16.6), 


(16.27) opp(a) = opxn(a) mod OPS7 6, 
with 
(16.28) 6 = min(p — 27,1 — p), 


provided (16.24) holds, and 
(16.29) a>0= > (opp(a)u,u) >0, Vue L?(R"). 


Also a real valued > op p(a)* = opp(a). 
From here, we can use a partition of unity and local coordinate charts to obtain 
the following. 


Proposition 16.1. Let M be a compact, n-dimensional Riemannian manifold. 
Pick p € (1 —1/n,1), and set y = (1 — p)(n — 1)/2. Take 6 as in (16.28). 
Then there is a quantization 


(16.30) opp: C™(S*M) — L(L7(M)), 


satisfying 


(16.31) opp(a) = opgy(a) mod OPS,°_.(M), VaeC™%(S*M), 


p,l—p 


and 


96 7. Pseudodifferential Operators 
(16.32) a>0= > opp(a) > 0. 

The following result is a useful complement to Proposition 16.1 
Proposition 16.2. In the setting of Proposition 16.1, one can also arrange that 
(16.33) opp(1) = I. 


Proof. Suppose we have op (1) = J. Then J = J* > 0 and J — I is compact, 
so {1} is the only accumulation point of the spectrum of J. We first alter opp 
slightly to guarantee that J = op (1) is injective. To do this, replace op -(a) by 
op p(a) + ae*, where a is the mean value of a. 

We know that J is Fredholm of index 0, so having J injective, we now have J 
invertible. Furthermore, J, J!/?, J~!/? are all = I mod OPS) =x (See Exercise 
1 below.) Thus, if we make the adjustment 


(16.34) epee) J opp(a)s—, 
the modified quantization op p satisfies all the desired criteria. 
Note that (16.32)—(16.33) imply 


(1635) ljopp(a)|lecus) < sup lal, Vae C™(S*M). 


Hence op has a unique extension to C(.S* M): 


Proposition 16.3. Let M be a compact, n-dimensional Riemannian manifold. 
Then the quantization op p in (16.30)-(16.33) has a unique extension to 


(16.36) opp : C(S*M) — L(L?(M)), 


satisfying (16.32), (16.33), and (16.35). 


REMARK. As indicated above, a more careful treatment of the remainders in 
(16.23) and (16.18) leads to a sharper result on the remainder in (16.31), known 
as the sharp Garding inequality. For more on this, we refer back to the end of 86 
of this chapter. Propositions 16.1—16.3 suffice for use in Chapter 8 of this book. 


Exercises 
1. Take a < 0 < b, and assume f € C'™((a, b)). Assume 


T €OPS,>_,(M), T=T"*, SpecT Cc (a,b), 


p,l—p 


References 97 


with 6 > 0, p € (1/2, 1], and define f(T) by the spectral theorem. Note that, for each 
N EN, we can write 


Nol e(k) 
f(z) = i Oak + a gw(2), gn € OC ((a,0)). 
k=0 7 
Hence 4 
=! p(k) 
i)=>; f Oli hg (ry, 
k=0 : 


and gn (T) is bounded on L?(M) and commutes with T’. Deduce that 
TN gn(T) : H°(M) — H**§*(M), for —Nd<s<0. 


Conclude that 
f(T) — f(0)I € OPS;3_,(M). 


p,l—p 


. In the setting of Proposition 16.3, show that 
a,b € C(S*M) => op,(ab) — op,-(a) opp(b) € K(L?(M)), 
where /C(L?(M)) denotes the space of compact operators on L*(/). Deduce that 


a,a~' € C(S*M) => op,(a) is Fredholm on L?(M). 


References 


[ADN] S. Agmon, A. Douglis, and L. Nirenberg, Estimates near the boundary for 
solutions of elliptic differential equations satisfying general boundary conditions, 
CPAM 12(1959), 623-727; II, CPAM 17(1964), 35-92 

[Be] R. Beals, A general calculus of pseudo-differential operators, Duke Math. J. 

42(1975), 142 

[BF] R. Beals and C. Fefferman, Spatially inhomogeneous pseudodifferential opera- 

tors, Comm. Pure Appl. Math. 27(1974), 1-24 

[BGM] M. Berger, P. Gauduchon, and E. Mazet, Le Spectre d’une Variété Riemannienne, 

LNM #194, Springer, New York, 1971 

[BJS] L. Bers, F. John, and M. Schechter, Partial Differential Equations, Wiley, 

New York, 1964 

[Cal] A. P. Calderon, Uniqueness in the Cauchy problem of partial differential equa- 

tions, Am. J. Math. 80(1958), 16-36 

[Ca2] A. P. Calderon, Singular integrals, Bull. AMS 72(1966), 427-465 

[Ca3] A. P. Calderon, Cauchy integrals on Lipschitz curves and related operators, Proc. 

NAS, USA 74(1977), 1324-1327 

[CV] A. P. Calderon and R. Vaillancourt, A class of bounded pseudodifferential oper- 

ators, Proc. NAS, USA 69(1972), 1185-1187 

[CZ] A. P. Calderon and A. Zygmund, Singular integral operators and differential 
equations, Am. J. Math. 79(1957), 901-921 


98 7. Pseudodifferential Operators 


[CMM] 


[Cor] 


[Cor2] 


[Cor3] 


[CH] 
[DK] 
[D] 
[DS] 
[Dui] 
[Eg] 
[FIR] 


[F] 
[FP] 


[Fo2] 
[Ga] 
[GT] 
[Gr] 


[GLS] 


[Hof] 


[HMT] 


[HMT2] 


[Hol] 


R. Coifman, A. McIntosh, and Y. Meyer, L’integrale de Cauchy definit un oper- 
ateur borne sur L? pour les courbes Lipschitziennes, Ann. Math. 116(1982), 
361-388 

H. O. Cordes, On compactness of commutators of multiplications and con- 
volutions, and boundedness of pseudo-differential operators, J. Funct. Anal. 
18(1975), 115-131 

H. O. Cordes, Elliptic Pseudodifferential Operators—An Abstract Theory, LNM 
7756, Springer, New York, 1979 

H. O. Cordes, Spectral Theory of Linear Differential Operators and Comparison 
Algebras, London Math. Soc. Lecture Notes #70, Cambridge University Press, 
London, 1987 

H. O. Cordes and E. Herman, Gelfand theory of pseudo-differential operators, 
Am. J. Math. 90(1968), 681-717 

B. Dahlberg and C. Kenig, Hardy spaces and the Neumann problem in L” for 
Laplace’s equation in Lipschitz domains, Ann. Math. 125(1987), 437-465 

G. David, Opérateurs d’intégrale singuliéres sur les surfaces réguliéres, Ann. Sci- 
ent. Ecole Norm. Sup. 21(1988), 225-258 

G. David and S. Semmes, Analysis of and on Uniformly Rectifiable Sets, Ameri- 
can Math. Soc., Providence, RI, 1993 

J. J. Duistermaat, Fourier Integral Operators, Courant Institute Lecture Notes, 
New York, 1974 

Y. Egorov, On canonical transformations of pseudo-differential operators, Uspehi 
Mat. Nauk. 24(1969), 235-236 

E. Fabes, M. Jodeit, and N. Riviere, Potential techniques for boundary problems 
in C' domains, Acta Math. 141(1978), 165-186 

C. Fefferman, The Uncertainty Principle, Bull. AMS 9(1983), 129-266 

C. Fefferman and D. Phong, On positivity of pseudo-differential operators, Proc. 
NAS, USA 75(1978), 4673-4674 

G. Folland, Harmonic Analysis on Phase Space, Princeton University Press, 
Princeton, NJ, 1989 

L. Garding, Dirichlet’s problem for linear elliptic partial differential equations, 
Math. Scand. 1(1953), 55-72 

D. Gilbarg and N. Trudinger, Elliptic Partial Differential Equations of Second 
Order, 2nd ed., Springer, New York, 1983 

P. Greiner, An asymptotic expansion for the heat equation, Arch. Rat. Mech. Anal. 
41(1971), 163-218 

A. Grossman, G. Loupias, and E. Stein, An algebra of pseudodifferential oper- 
ators and quantum mechanics in phase space, Ann. Inst. Fourier 18(1969), 
343-368 

S. Hofmann, On singular integral operators of Calderén’s type in R”, and BMO, 
Revista Math. Iberoam. 10 (1994), 467-505. 

S. Hofmann, M. Mitrea, and M. Taylor, Singular integrals and elliptic bound- 
ary problems on regular Semmes-Kenig-Toro domains, International Math. 
Research Notices 2010 (2010), 2567-2865. 

S. Hofmann, M. Mitrea, and M. Taylor, Symbol calculus for operators of layer 
potential type on Lipschitz surfaces with vmo normals, and related pseudodiffer- 
ential operator calculus, Anal. and PDE 8 (2015), 115-181. 

L. Hormander, Pseudo-differential operators, Comm. Pure Appl. Math. 18(1965), 
501-517 


[Ho2] 


[Ho3] 
[Ho4] 


[Ho5] 
[How] 
[JK] 
[K] 


[Keg] 
[KT] 


[KN] 


[Kg] 
[LM] 


[MS] 
[Mik] 
[MMM] 


[M3T] 


[MMT] 


References 99 


L. Hérmander, Pseudodifferential operators and hypoelliptic equations, Proc. 
Symp. Pure Math. 10(1967), 138-183 

L. Hérmander, Fourier integral operators I, Acta Math. 127(1971), 79-183 

L. Hérmander, The Wey] calculus of pseudodifferential operators, Comm. Pure 
Appl. Math. 32(1979), 355-443 

L. Hérmander, The Analysis of Linear Partial Differential Operators, Vols. 3 and 
4, Springer, New York, 1985 

R. Howe, Quantum mechanics and partial differential equations, J. Funct. Anal. 
38(1980), 188-254 

D. Jerison and C. Kenig, The inhomogeneous Dirichlet problem in Lipschitz 
domains, J. Funct. Anal. 130(1995), 161-219 

T. Kato, Boundedness of some pseudo-differential operators, Osaka J. Math. 
13(1976), 1-9 

O. Kellogg, Foundations of Potential Theory, Dover, New York, 1954 

C. Kenig and T. Toro, Harmonic measure on locally flat domains, Duke Math. J. 
87 (1997), 509-551. 

J. J. Kohn and L. Nirenberg, An algebra of pseudo-differential operators, Comm. 
Pure Appl. Math. 18(1965), 269-305 

H. Kumano-go, Pseudodifferential Operators, MIT, Cambridge, MA, 1981 

J. Lions and E. Magenes, Non-homogeneous Boundary Problems and Applica- 
tions I, IT, Springer, New York, 1972 

H. McKean and I. Singer, Curvature and the eigenvalues of the Laplacian, J. Diff: 
Geom. 1(1967), 43-69 

S. Mikhlin, Multidimensional Singular Integral Equations, Pergammon, 
New York, 1965 

D. Mitrea, I. Mitrea, and M. Mitrea, Geometric Harmonic Analysis, Springer, 
New York, 2023. 

D. Mitrea, I. Mitrea, M. Mitrea, and M. Taylor, The Hodge Laplacian: Boundary 
Value Problems on Riemannian Manifolds, W. de Gruyter, New York, 2016. 

I. Mitrea, M. Mitrea, and M. Taylor, Cauchy integrals, Calderén projectors, and 
Toeplitz operators on uniformly rectifiable domains, Adv. in Math. 268 (2015), 
666-757. 


[MMT2] I. Mitrea, M. Mitrea, and M. Taylor, Multidimensional Riemann-Hilbert prob- 


[MT1] 


[MT2] 


[Miz] 


[Mus] 
[Ni] 


[Pal] 


[Po] 


[RS] 


lems on domains with uniformly rectifiable interfaces, Preprint. 

M. Mitrea and M. Taylor, Boundary layer methods for Lipschitz domains in 
Riemannian manifolds, J. Funct. Anal. 163(1999), 181-251 

M. Mitrea and M. Taylor, Potential theory on Lipschitz domains in Riemannian 
manifolds: Sobolev-Besov space results and the Poisson problem, J. Funct. Anal. 
176(2000), 1-79 

S. Mizohata, The Theory of Partial Differential Equations, Cambridge University 
Press, Cambridge, 1973 

N. Muskhelishvilli, Singular Integral Equations, P. Nordhoff, Groningen, 1953 
L. Nirenberg, Lectures on Linear Partial Differential Equations, Reg. Conf. Ser. 
in Math., no. 17, AMS, Providence, RI, 1972 

R. Palais, Seminar on the Atiyah-Singer Index Theorem, Princeton University 
Press, Princeton, NJ, 1965 

J. Polking, Boundary value problems for parabolic systems of partial differential 
equations, Proc. Symp. Pure Math. 10(1967), 243-274 

M. Reed and B. Simon, Methods of Mathematical Physics, Academic, New York, 
Vols. 1,2, 1975; Vols. 3,4, 1978 


100 7. Pseudodifferential Operators 


[Sel] 
[Se2] 
[Se3] 
[Sem] 
[So] 
[St] 


[St2] 
[SW] 


(T1] 
[T2] 


(T3] 
(T4] 


[TS] 
[T6] 
[Tre] 


[Ver] 


[V] 
[Wey] 


[Zw] 


R. Seeley, Refinement of the functional calculus of Calderon and Zygmund, Proc. 
Acad. Wet. Ned. Ser. A 68(1965), 521-531 

R. Seeley, Singular integrals and boundary value problems, Am. J. Math. 
88(1966), 781-809 

R. Seeley, Complex powers of an elliptic operator, Proc. Symp. Pure Math 
10(1967), 288-307 

S. Semmes, Chord arc surfaces with small constant, Adv. Math. 85 (1991), 198— 
223. 

S. Sobolev, Partial Differential Equations of Mathematical Physics, Dover, 
New York, 1964 

E. Stein, Singular Integrals and the Differentiability of Functions, Princeton 
University Press, Princeton, NJ, 1972 

E. Stein, Harmonic Analysis, Princeton University Press, Princeton, NJ, 1993 

E. Stein and G. Weiss, Introduction to Fourier Analysis on Euclidean Space, 
Princeton University Press, Princeton, NJ, 1971 

M. Taylor, Pseudodifferential Operators, Princeton University Press, Princeton, 
NJ, 1981 

M. Taylor, Noncommutative Microlocal Analysis, Memoirs AMS, no. 313, 
Providence, RI, 1984 

M. Taylor, Noncommutative Harmonic Analysis, AMS, Providence, RI, 1986 

M. Taylor, Pseudodifferential Operators and Nonlinear PDE, Birkhauser, 
Boston, 1991 

M. Taylor, Tools for PDE: Pseudodifferential Operators, Paradifferential Oper- 
ators, and Layer Potentials, AMS, Providence RI, 2000. 

M. Taylor, Fefferman-Phong inequalities in diffraction theory, Proc. Symp. Pure 
Math. 43 (1985), 261-300. 

F. Treves, Introduction to Pseudodifferential Operators and Fourier Integral 
Operators, Plenum, New York, 1980 

G. Verchota, Layer potentials and regularity for the Dirichlet problem for 
Laplace’s equation in Lipschitz domains, J. Funct. Anal. 39(1984), 572-611 

A. Voros, An algebra of pseudodifferential operators and asymptotics of quantum 
mechanics, J. Funct. Anal. 29(1978), 104-132 

H. Weyl, The Theory of Groups and Quantum Mechanics, Dover, New York, 
1931 

M. Zworski, Semiclassical Analysis, GSM #138, AMS, Providence RI, 2012. 


: ® 


Check for 
updates 


Spectral Theory 


Introduction 


This chapter is devoted to the spectral theory of self-adjoint, differential opera- 
tors. We cover a number of different topics, beginning in §1 with a proof of the 
spectral theorem. After some consideration, we decided to put that material here, 
rather than in Appendix A, Outline of Functional Analysis. The main motivation 
for putting it here is to begin a line of reasoning that will be continued in sub- 
sequent sections, using the great power of studying unitary groups as a tool in 
spectral theory. After we show how easily this study leads to a proof of the spec- 
tral theorem in §1, in later sections we use it in various ways: as a tool to establish 
self-adjointness, as a tool for obtaining specific formulas, including basic identi- 
ties among special functions, and in other capacities. 

Sections 2 and 3 deal with some general questions in spectral theory, such as 
when does a differential operator define a self-adjoint operator, when does it have 
a compact resolvent, and what asymptotic properties does its spectrum have? We 
tackle the latter question, for the Laplace operator A, by examining the asymptotic 
behavior of the trace of the solution operator e'“ for the heat equation, showing 
that 


(0.1) Tr eS = (4nt)-"/? vol M + o(t-"/7),  t\,0, 


when 2 is either a compact Riemannian manifold or a bounded domain in R” 
(and has the Dirichlet boundary condition). We bring in a Tauberian theorem to 
show that (0.1) leads to 


(47)—"/? volQ 


yr/2 \r/2 
r+) Fon) 


(0.2) N(A) = 


where N(A) is the spectral counting function, N(A) = #{j : Aj < A}. Using 
techniques developed in §13 of Chap. 7, we could extend (0.1) to general compact 
Riemannian manifolds with smooth boundary and to other boundary conditions, 
such as the Neumann boundary condition. We use instead a different method here 
© The Author(s), under exclusive license to Springer Nature Switzerland AG 2023 101 


M. E. Taylor, Partial Differential Equations I, Applied Mathematical Sciences 116, 
https://doi.org/10.1007/978-3-03 1-33700-0_8 


102 8. Spectral Theory 


in §3, one that works without any regularity hypotheses on 02. In such generality, 
(0.1) does not necessarily hold for the Neumann boundary problem. 

The study of (0.1) and refinements got a big push from [Kac]. As pursued in 
[MS], it led to developments that we will discuss in Chap. 10. The problem of to 
what extent a Riemannian manifold is determined by the spectrum of its Laplace 
operator has led to much work, which we do not include here. Some is discussed 
in [Ber, Br, BGM, Cha], and [NRR]. We mention particularly some distinct regions 
in R? whose Laplace operators have the same spectra, given in [GWW]. 

We have not included general results on the spectral behavior of A obtained 
via geometrical optics, its refinement, the theory of Fourier integral operators, 
and other microlocal techniques. Results of this nature can be found in Volume 
3 of [Ho], and in [Shu], [T1], [Iv], and [Zw]. One direction is to improve the 
remainder estimate in (0.2) from 0(A"/?) to O(A("—))/?), Refinements, initiated 
in [DG], lead to a sharper remainder estimate, A AC—D/ 2), under additional con- 
straints on the dynamics of the geodesic flow, and under further constraints, to 
O(A\—)/? / Jog X), as in [Be], and (in much greater generality) in [CaG]. See 
these works for details. 

Sections 4—9 are devoted to specific examples. In 84 we study the Laplace 
operator on the unit spheres S”. We specify precisely the spectrum of A and 
discuss explicit formulas for certain functions of A, particularly 


1/2 
(0.3) AtsintA, A= (-A+4(n-1)) 


with K = 1, the sectional curvature of S”. In §5 we obtain an explicit formula 
for (0.3), with kK = —1, on hyperbolic space. In 86 we study the spectral theory 
of the harmonic oscillator 


(0.4) H=-A+|z/?. 


We obtain an explicit formula for e~'# 


Chap. 10. In §7 we study the operator 


, an analogue of which will be useful in 


(0.5) H=-A-K|z|7* 


on R®, obtaining in particular all the eigenvalues of this operator. This operator 
arises in the simplest quantum mechanical model of the hydrogen atom. In §8 we 
look at a quantum mechanical model for a deuteron given by 


(0.6) H=—A—VoXa, 


where Vo is a positive constant and yq(x) = 1 for |x| < a, 0 otherwise. We focus 
on values of Vo and a experimentally seen to be relevant for the deuteron. In §9 
we study the Laplace operator on a cone. Studies done in these sections bring in a 
number of special functions, including Legendre functions, Bessel functions, and 


1. The spectral theorem 103 


hypergeometric functions. We have included two auxiliary problem sets, one on 
confluent hypergeometric functions and one on hypergeometric functions. 

There follow sections on two general phenomena related to quantum mechan- 
ics. Section 10 considers an adiabatic limit of slowly varying self-adjoint opera- 
tors with discrete spectra. Section | 1 presents a quantum ergodic theorem. 

This chapter ends with two appendices. Appendix A discusses the classical 
ergodic theorem behind the quantum result of §11. Appendix B derives formulas 
relating the solution operators to wave equations and shifted wave equations, of 
relevance to the results of §§4—S. 


1. The spectral theorem 


Appendix A, Outline of Functional Analysis, contains a proof of the spectral the- 
orem for a compact, self-adjoint operator A on a Hilbert space H. In that case, 
H has an orthonormal basis {u,;} such that Au; = A;u,;, A; being real numbers 
having only 0 as an accumulation point. The vectors u; are eigenvectors. 

A general bounded, self-adjoint operator A may not have any eigenvectors, and 
the statement of the spectral theorem is somewhat more subtle. The following is 
a useful version. 


Theorem 1.1. [f A is a bounded, self-adjoint operator on a separable Hilbert 
space H, then there is a o-compact space Q, a Borel measure 1, a unitary map 


(1.1) W : L?(0,du) — H, 

and a bounded, Borel function a : Q — R such that 

(1.2) W-*AW f(x) =a(z)f(z), Vf € L°(Q, dy). 

The map (1.2) is called a spectral representation of A. Note that when A is com- 
pact, the eigenvector decomposition described above yields (1.1) and (1.2) with 
(Q, 44) a purely atomic measure space. Later in this section we will extend Theo- 
rem 1.1 to the case of unbounded, self-adjoint operators. 


In order to prove Theorem 1.1, we will work with the operators 


(1.3) U(t) = e*4, 


(14) eit = 7" an 


104 8. Spectral Theory 


This is a special case of a construction made in §4 of Chap. 1. U(¢) is uniquely 
characterized as the solution to the differential equation 


(1.5) <ul) =iAU(t), U(0)=I. 


We have the group property 
(1.6) U(s+t) =U(s)U(0), 


which follows since both sides satisfy the ODE (d/ds)Z(s) = 1AZ(s), Z(0) = 
U(t). If A = A*, then applying the adjoint to (1.4) gives 


(1.7) U(t)* =U(-t), 


which is the inverse of U(t) in view of (1.6). Thus {U(t) : t € R} is a group of 
unitary operators. 

For a given v € H, let H, be the closed linear span of {U(t)u : t € R}; we 
say f1, is the cyclic space generated by v. We say v is a cyclic vector for H if 
H = H,. If H, is not all of H, note that H; is invariant under U(t), that is, 
U(t)H+ Cc H;} for all t, since for a linear subspace V of H, generally 


(1.8) UV CV = U(t)*Vt cVt. 


Using this observation, we can prove the next result. 


Lemma 1.2. Jf U(t) is a unitary group on a separable Hilbert space H, then H 
is an orthogonal direct sum of cyclic subspaces. 


Proof. Let {w,;} be a countable, dense subset of H. Take vj = w; and H; = 
H,,. If Hy # H, let v2 be the first nonzero element P;w;, 7 > 2, where P, is 
the orthogonal projection of H onto Hj, and let Hy = H,,. Continue. 


In view of this, Theorem 1.1 is a consequence of the following: 


Proposition 1.3. /f A = A* € L(H), U(t) = eA, and H has a cyclic vector v, 
then we can take Q = R, and there exists a positive Borel measure ps on R and a 
unitary map W : L?(IR, dj) —> H such that 

(1.9) W-lU(t)W f(x) =e" f(x), Vf € L?(R, dy). 


One ingredient in the proof of Proposition 1.3 will be the following functional 
calculus. Given f € S(R), the Schwartz space, we define 


(1.10) f(A) = (Qn)71?? l- fee" at. 


1. The spectral theorem 105 
Here i is the Fourier transform of f, and (1.10) formally reflects the Fourier 
inversion formula. Note that || f(A)|] < (27)~!/?|| f|| ;1. We record the following 


basic results. 


Lemma 1.4. Given f,g € S(R”), 


(1.11) fs(A) =e f(A), with f.(x) = e8* f(x), 
(1.12) GA) F(A) = (gf)(A), 

and 

(1.13) F(A) = f(A)". 


Proof. Since f,(t) = f(t — s), we have 
fs(A) ae f(t—s) ett dt 


a eiltts) )A dt, 


(1.14) 


which gives (1.11). From here, 


(of)(A) = (2ny1?? f a(s)(el**f)(A)as 
(1.15) 

= (2n)-¥? f a(s)e'* F(A) ds 

by (1.11), and this gives (1.12). The proof of (1.13) is an exercise. 


We now tackle the proof of Proposition 1.3. To define the map W, we first 
define 


(1.16) W : S(R) — H, 
by 
(1.17) W(f) = f(A)v, 


with f(A) defined by (1.10). We set 


(1.18) (f(A)v, 0) = (fH), 


106 8. Spectral Theory 
defining the tempered distribution 4 € S’(R). We will show below that ju is 


actually a positive measure. 
Making use of (1.12)-(1.13), we have the following key identity: 


(f(A)v, g(A)v) = (g(A)* F(A)x, ») 
(1.19) = (fg(A)v, v) 
= (f9, 4). 


Now, if g = f, the left side of (1.19) is || f(A)v||?, which is > 0. Hence 
(1.20) (\f|?, 4) > 0, forall f € S(R). 
With this, we can establish: 


Lemma 1.5. The tempered distribution 1, defined by (1.18), is a positive measure 
on R. 


Proof. It suffices to show that (g, 4) > 0 when g € C§°(R) and g > 0. So take 
such g, and then take 7) € C§°(R) such that 7) = 1 on supp g. We have 


(9,4) = (gv, w) = lim ((9 + e)w”, 1) 


=i 1/2,),)2 > 
lim (((9 + €) wv)", bu) = 0, 
as desired. 


Now we can finish the proof of Proposition 1.3. Since (fg, u) = [ fg dp, we 
see from (1.19) that W has a unique continuous extension 


(1.21) W : L?(R, du) — H, 
and W is an isometry. Since v is assumed to be cyclic, the range of W must be 


dense in H, so W must be unitary. We have from Lemma 1.4 that if f € S(R), 
then 


(1.22) el" f(A) = fa(A), with f(r) = 7 f(r). 

Hence, for f € S(R), 

(1.23) W-te*4y f = W-!f,(A)u = e®7 f(r). 

Since S(R) is dense in L?(R, dz), this gives (1.9). Thus the spectral theorem for 


bounded, self-adjoint operators is proved. 
Given (1.9), we have from (1.10) that 


1. The spectral theorem 107 
(1.24) Wo'f(A)W g(x) = f(a)g(z), Ff € S(R), 9 € LR, dy), 


which justifies the notation f(A) in (1.10). 
Note that (1.9) implies 


(1.25) W AW f(z)=af(x), f € L(R, dy), 


since (d/dt)U(t) = iAU(t). The essential supremum of « on (R, j4) is equal to 
|| Al]. Thus 4: has compact support in R if A is bounded. If a self-adjoint operator 
A has the representation (1.25), one says A has simple spectrum. It follows from 
Proposition 1.3 that A has simple spectrum if and only if it has a cyclic vector. 

One can generalize the results above to a k-tuple of commuting, bounded, 
self-adjoint operators A = (A;,...,A,). In that case, for t = (t1,...,t,) € R*, 
set 


(1.26) U(it)=e*4, t-A=tArt--- +t, Ag. 


The hypothesis that the A; all commute implies U(t) = Ui(t1) ---Ux(tx), where 
U;(s) = e’S4;, U(t) in (1.26) continues to satisfy the properties (1.6) and (1.7); 
we have a k-parameter unitary group. As above, for v € H, we set H.,, equal to the 
closed linear span of {U(t)v : t € R*}, and we say v is a cyclic vector provided 
H, = H. Lemma 1.2 goes through in this case. Furthermore, for f € S(R*), we 
can define 


(1.27 F(A) = (2m)¥?? f Fee at, 


and if H has a cyclic vector v, the proof of Proposition 1.3 generalizes, giving a 
unitary map W : L?(IR*, dj) —> H such that 


(1.28) W-lU(t)Wf(x) =e* f(x), fe L7(R*,du), teR*. 


Therefore, Theorem 1.1 has the following extension 


Proposition 1.6. [f A = (A1,..., Ax) is a k-tuple of commuting, bounded, self- 
adjoint operators on H, there is a o-compact space Q, a Borel measure 1, a 
unitary map W:L?(Q,, dj) —> H, and bounded Borel functions a; : Q — R such 
that 


(129) WIAWW F(a) =4;(2) f(x), f € LO, dp), 1 <5 <b. 


A bounded operator B € £(f) is said to be normal provided B and B* com- 
mute. Equivalently, if we set 


1 


(1.30) A; = =(B+B"), A= (8 =B"), 


ND] rR 


108 8. Spectral Theory 


then B = Ay; + tApg, and (Aj, Ag) is a 2-tuple of commuting, self-adjoint 
operators. Applying Proposition 1.6 and setting b(a) = a;(a) + ia2(x), we have: 


Corollary 1.7. If B € L(A) is a normal operator, there is a unitary map W : 
L?(Q, du) > H and a (complex-valued) b € L®(Q, di) such that 


(1.31) W'BWf (x) = (x) f(x), f € L7?(Q,dy). 


In particular, Corollary 1.7 holds when B = U is unitary. We next extend 
the spectral theorem to an unbounded, self-adjoint operator A on a Hilbert space 
H, whose domain D(A) is a dense linear subspace of H. This extension, due to 
von Neumann, uses von Neumann’s unitary trick, described in (8.18)-(8.19) of 
Appendix A. We recall that, for such A, the following three properties hold: 


Ati: D(A) — H bijectively, 
(1.32) U =(A—i)(A+i)~? is unitary on H, 
A=iI+U)\I-U)", 


where the range of J — U = 2i(A + i)~! is D(A). Applying Corollary 1.7 to 
B =U, we have the following theorem: 


Theorem 1.8. /f A is an unbounded, self-adjoint operator on a separable Hilbert 
space H, there is a o-compact space Q, a Borel measure 1, a unitary map W : 
L?(Q, du) — H, and a real-valued Borel function a on Q such that 


(1.33) W*AW f(x) =a(xz)f(x), Wf € D(A). 


In this situation, given f € L?(Q,du), Wf belongs to D(A) if and only if the 
right side of (1.33) belongs to L?(Q, du). 


The formula (1.33) is called the “spectral representation” of a self-adjoint 


operator A. Using it, we can extend the functional calculus defined by (1.10) as 
follows. For a Borel function f : R — C, define f(A) by 


(1.34) W~" f(A)Wg(x) = f(a(a))g(2). 


If f is a bounded Borel function, this is defined for all g € L?(Q, dj) and provides 
a bounded operator f(A) on H. More generally, 


(1.35) D(f(A)) = {Wg € H: 9 € L7(Q, dp) and f(a(x))g € L?(Q,dy)}. 
In particular, we can define eA for unbounded, self-adjoint A, by 


W-te#4Woa J eit) a() 


1. The spectral theorem 109 


Then e’4 is a strongly continuous unitary group, and we have the following 
result, known as Stone’s theorem (stated as Proposition 9.5 in Appendix A): 


Proposition 1.9. If A is self-adjoint, then iA generates a strongly continuous, 
unitary group, U(t) = e4, 


Note that Lemma 1.2 and Proposition 1.3 are proved for a strongly continuous, 
unitary group U(t) = e’*4, without the hypothesis that A be bounded. This yields 
the following analogue of (1.2): 


(1.36) WlU(t) Wf (x) = et f(x), f € L7(0, du), 


for this more general class of unitary groups. Sometimes a direct construction, 
such as by PDE methods, of U(t) is fairly easy. In such a case, the use of U(t) 
can be a more convenient tool than the unitary trick involving (1.32). 

We say a self-adjoint operator A is positive, A > 0, provided (Au, u) > 0, 
for all wu € D(A). In terms of the spectral representation, this says we have (1.33) 
with a(x) > 0 on 2. In such a case, e~*A is bounded for t > 0, even for complex 
t with Re ¢ > 0, and also defines a strongly continuous semigroup. This proves 
Proposition 9.4 of Appendix A. 

Given a self-adjoint operator A and a Borel set S C R, define P(S) = x5(A), 
that is, using (1.33), 


(1.37) W'P(S)Wg = xs(a(z))9(z), g € L7(O, dy), 


where yg is the characteristic function of S. Then each P(S) is an orthogonal 
projection. Also, if S = U j>d S'; is a countable union of disjoint Borel sets S;, 
then, for each u € H, - 


(1.38) lim S > P(S;)u = P(S)u, 
noo j= 
with convergence in the H-norm. This is equivalent to the statement that 
\ xs,(a(x))g + xs(a(x))g in L?-norm, for each g € L?(Q, dy), 
j=l 


which in turn follows from Lebesgue’s dominated convergence theorem. By 
(1.38), P(-) is a strongly countably additive, projection-valued measure. Then 
(1.34) yields 


(1.39) f(A) = fro P(dd). 


P(-) is called the spectral measure of A. 


110 8. Spectral Theory 


One useful formula for the spectral measure is given in terms of the jump of 
the resolvent Ry = (\ — A)~', across the real axis. We have the following 


Proposition 1.10. For bounded, continuous f : R > C, 


(1.40) f(A)u= lin =f fO) (A -ie- A) - (A+ie- A? u dX. 


e\,0 272 


—oo 


Proof. Since W~! f(A)W is multiplication by f(a(a)), (1.40) follows from the 
fact that 


(1.41) -f a dX — f(a(x)), 


pointwise and boundedly, as ¢ \, 0. 


An important class of operators f(A) are the fractional powers f(A) = 
A®, a € (0,00), defined by (1.34)-(1.35), with f(A) = A%, provided A > 0. 
Note that if g € C([0,00)) satisfies g(0) = 1, g(A) = O(A~%) as A > ow, then, 
foru € H, 


(1.42) u € D(A®) ==> ||A%g(€A)ull a is bounded, fore € (0,1), 


as follows easily from the characterization (1.35) and Fatou’s lemma. We note 
that Proposition 2.2 of Chap. 4 applies to D(A®), describing it as an interpolation 
space. 

We particularly want to identify D(AV 2), when A is a positive, self-adjoint 
operator on a Hilbert space H constructed by the Friedrichs method, as described 
in Proposition 8.7 of Appendix A. Recall that this arises as follows. One has a 
Hilbert space H,, a continuous injection J : H, —> H with dense range, and one 
defines A by 


(1.43) (A(Ju), de) = (u,v) H,, 
with 


D(A) = {Jue JH, CH: v4 (u,v), is 


(1.44) 
continuous in Jv, in the H -norm}. 
We establish the following. 


Proposition 1.11. [f A is obtained by the Friedrichs extension method (1.43)- 
(1.44), then 


(1.45) D(A?) = J(H;) CH. 


1. The spectral theorem 111 


Proof. D(A‘/?) consists of elements of H that are limits of sequences in D(A), 
in the norm ||A‘/?ul| 7 + |lul|z. As shown in the proof of Proposition 8.7 in 
Appendix A, D(A) = R(JJ*) and A~! = JJ*. Now 


(1.46) AVI RS AL iI fe =F fiRy- 


Thus a sequence (J.J* f,) converges in the D(A!/?)-norm (to an element g) if 
and only if (J* f,) converges in the H,-norm (to an element wu), in which case 
g = Ju. Since J* : H — Hy has dense range, precisely all u € Hj arise as limits 
of such (J* f,,), so the proposition is proved. 


Exercises 


1. The map f +> f(A) given by (1.10) defines a linear transformation a : S(R) — L(H). 
Show that 
feS(R), f >0=> f(A) 20. 


Deduce that 
f € S(R) = ||f(A)|| < sup [fl]. 


2. Let I = [-||All, || All]. Show that 


f =Oonl f(A) =0. 


Deduce that a has a unique continuous linear extension a : C(I) — L(A), satisfying 
Fe CW) = IFA) S sup If, 
f >0onl => f(A) = 0. 
3. Let g : R — R bea bounded function and suppose there exist f, € CIR) such that 
f(A) = 9A), VAER, |fe(A)| <M, Vk,A. 


Show that 
f(A) — g(A), strongly, on H, 


where g(A) is defined via a spectral representation of A. Deduce that g(A) is well 
defined, independently of the spectral representation chosen. 

4. Let K C R be compact. Show that Exercise 3 applies to g(A) = x«(A), and hence 
P(K) is well defined independent of the spectral representation of A, where P is the 
spectral measure, defined in (1.37). 

5. Given v € H, consider the positive Borel measure p, on R, defined by 


fw (S) = (P(S)v, v). 
Since each finite Borel measure on R is regular, deduce that 
Liv (S') = sup{piv(K) : K CS, K compact} 


is independent of the spectral representation of A. 


112 8. Spectral Theory 


6. Now set 
Bu,v(S) = (P(S)u, v), 
for u,v € H, S C Ra Borel set. Show that ju, is a complex measure. Show that 


Mut+v,utuv = Hu,u + Lu,v + 2 Re [u,v, 


and derive a similar formula for Im j14,,. Deduce that P(S') is well defined, indepen- 
dent of the choice of spectral representation of A. 

7. Given a bounded Borel function F’ on R, and a spectral representation as in Theorem 
1.1, define F(A) € £L(H) by 


F(A)u=WF(a(x))W*u, ue H. 


Show that, with fu, as in Exercise 6, 


(F(Au,) =f Fdjine, Yuu eH, 
R 


and deduce that F'(A) is well defined, independent of the choice of spectral representa- 
tion of A 
8. Take H = L?(R), D = (1/i)d/dz, so e”*? u(x) = u(a + t). Show that 


Fe ule) =e Fulé), f(D) =F MyF, 


where F is the Fourier transform. Then show that if we pick v € L?(R), the formula 
(1.17) for W f becomes 
Wf = f(D)v = F'(fe). 


Deduce directly that 
g(D) = F°M,F => g(D)W(f) = W(9f). 


Show that 
H, = L?(R) 6(€) £0, forae.€ ER. 


2. Self-adjoint differential operators 


In this section we present some examples of differential operators on a manifold Q 
which, with appropriately specified domains, give unbounded, self-adjoint oper- 
ators on L7(Q,dV), dV typically being the volume element determined by a 
Riemannian metric on 22. 

We begin with self-adjoint operators arising from the Laplacian, making use of 
material developed in Chap. 5. Let 2 be a smooth, compact Riemannian manifold 
with boundary, or more generally the closure of an open subset 22 of a compact 
manifold M without boundary. Then, as shown in Chap. 5, 


(2.1) I-A: Hj(Q) — HG(Q)* 


2. Self-adjoint differential operators 113 


is bijective, with inverse we denote T’; if we restrict T' to L? (Q), 
(2.2) T : L?(Q) —+ L?(Q) is compact and self-adjoint. 


Denote by R(T’) the image of L?(Q) under 7’. We can apply Proposition 8.2 of 
Appendix A to deduce the following 


Proposition 2.1. [f Q is a region in a compact Riemannian manifold M, then A 
is self-adjoint on L?(Q), with domain D(A) = R(T) Cc Hj (Q) described above. 


For a further description of D(A), note that 

(2.3) D(A) = {u € Hg(Q) : Au € L?(Q)}. 

If OQ is smooth, we can apply the regularity theory of Chap. 5 to obtain 
(2.4) DA) = 8, (na): 


Instead of relying on Proposition 8.2, we could use the Friedrichs construction, 
given in Proposition 9.7 of Appendix A. This construction can be applied more 
generally. Let 2 be any Riemannian manifold, with Laplace operator A. We can 
define Hj(Q) to be the closure of C§°(Q) in the space {u € L?(Q) : du € 
L?(Q, A*)}. The inner product on Hg (2) is 


(2.5) (u,v)1 = (u,v) r2 + (du, dv) r2. 


We have a natural inclusion Hj (Q) — L?(Q), and the Friedrichs method gives a 
self-adjoint operator A on L?(Q) such that 


(2.6) (Au, v)r2 = (u,v)1, foru € D(A), v € H9(M), 

with 

0.7) D(A) = {u € Hj(Q): v +> (u,v); extends from Ho (Q) + C toa 
continuous linear functional L7(Q) + C}, 

that is, 

(2.8) D(A) = {u € Hj(Q) : Af € L?(Q) such that 


(u,v)i = (f, v) 22, Vu € Hg (Q)}. 


Integrating (2.5) by parts for v € C§°(Q), we see that A = I — A on D(A), so 
we have a self-adjoint extension of A in this general setting, with domain again 
described by (2.3). 

The process above gives one self-adjoint extension of A, initially defined on 
C§° (Q). It is not always the only self-adjoint extension. For example, suppose 2 


114 8. Spectral Theory 


is compact with smooth boundary; consider H'((Q), with inner product (2.5), and 
apply the Friedrichs extension procedure. Again we have a self-adjoint operator 
A, extending J — A, with (2.8) replaced by 
D(A) = {u € H*(Q) : Af € L?(Q) such that 

(u,v)1 — (f,v) 12, Vu € H'(Q)}. 


(2.9) 


In this case, Proposition 7.2 of Chap. 5 yields the following 


Proposition 2.2. If Q is a smooth, compact manifold with boundary and A the 
self-adjoint extension just described, then 


(2.10) D(A) = {u € H?(Q) : u = 0 on AQ}. 


In case (2.10), we say D(A) is given by the Neumann boundary condition, 
while in case (2.4) we say D(A) is given by the Dirichlet boundary condition. 

In both cases covered by Propositions 2.1 and 2.2, (—A)!/? is defined as a 
self-adjoint operator. We can specify its domain using Proposition |.11, obtaining 
the next result: 


Proposition 2.3. In case (2.3), D((—A)/?) = H§(Q); in case (2.10), 
D((-A)/?) = H1(Q). 


Though A on C§°(Q) has several self-adjoint extensions when 2 has a bound- 
ary, it has only one when (2 is a complete Riemannian manifold. This is a classical 
result, due to Roelcke; we present an elegant proof due to Chernoff [Chn]. When 
an unbounded operator Ag on a Hilbert space H, with domain Do, has exactly 
one self-adjoint extension, namely the closure of Ap, we say Ao is essentially 
self-adjoint on Do. 


Proposition 2.4. [f Q is a complete Riemannian manifold, then A is essentially 
self-adjoint on C§°(Q). Thus the self-adjoint extension with domain given by (2.3) 
is the closure of A on C§°(Q). 


Proof. We will obtain this as a consequence of Proposition 9.6 of Appendix A, 
which states the following. Let U(t) = e“4 be a unitary group on a Hilbert space 
H which leaves invariant a dense linear space D; U(t)D C D. If Ais an extension 
of Ap and Ap : D > PD, then Apo and all its powers are essentially self-adjoint 
on D. 

In this case, U(t) will be the solution operator for a wave equation, and we will 
exploit finite propagation speed. Set 


(2.11) ito=(° a D(Ao) = Cg? (Q) & Cg (Q). 


The group U(t) will be the solution operator for the wave equation 


2. Self-adjoint differential operators 115 


(2.12) U(t) () = co) 


where u(t, x) is determined by 


O7u 


2 7 (A lu=0; u(0, x) = f, ur(0,2) =g 


It was shown in §2 of Chap. 6 that U(t) is a unitary group on H = Hj(Q) @ 
L?(Q); its generator is an extension of (2.11), and finite propagation speed implies 
that U(t) preserves C§°(Q) @ C§° (Q) for all t, provided Q is complete. Thus each 
AK is essentially self-adjoint on this space. Since 


ee | 0 
(2.13) —A? = ( 5 re , 


we have the proof of Proposition 2.3. Considering A?’, we deduce furthermore 
that each power A* is essentially self-adjoint on C§° (Q), when 2 is complete. 


Though A is not essentially self-adjoint on C§°(Q) when 2 is compact, we do 
have such results as the following: 


Proposition 2.5. Jf is a smooth, compact manifold with boundary, then A is 
essentially self-adjoint on 


(2.14) {ue O~M(Q) : u=00n 00}, 

its closure having domain described by (2.3). Also, A is essentially self-adjoint on 
(2.15) {ue C%(Q) : du =00n ON}, 

its closure having domain described by (2.10). 


Proof. It suffices to note the simple facts that the closure of (2.14) in H?(Q) is 
(2.3) and the closure of (2.15) in H?(Q) is (2.10). 


We note that when 2 is a smooth, compact Riemannian manifold with bound- 
ary, and D(A) is given by the Dirichlet boundary condition, then 


(2.16) () D(A’) = {ue CM) : Aku = 0 on AQ, & =0,1,2,...}, 


jJ=1 


and when D(A) is given by the Neumann boundary condition, then 


(2.17) () D(A’) = {ue C*@) : 8,(A*u) = 0 on AN, k > O}. 
j=l 


116 8. Spectral Theory 


We now derive a result that to some degree amalgamates Propositions 2.4 and 
2.5. Let Q be a smooth Riemannian manifold with boundary, and set 


(2.18) C2 (Q) = {u € C*(Q) : supp u is compact in 0}; 


we do not require elements of this space to vanish on 0Q. We say that Q is com- 
plete if it is complete as a metric space. 


Proposition 2.6. Jf is a smooth Riemannian manifold with boundary which is 
complete, then A is essentially self-adjoint on 


(2.19) {u € CO (Q) : u=00n OO}. 
In this case, the closure has domain given by (2.3). 


Proof. Consider the following linear subspace of (2.19): 
(2.20) Do = {u € CX(Q) : A2u = 0 on 00 for j = 0,1,2,...}. 


Let U(t) be the unitary group on Hj}(Q) 6 L?(Q) defined as in (2.12), with u 
also satisfying the Dirichlet boundary condition, u(t, 2) = 0 for « € OQ. Then, 
by finite propagation speed, U(t) preserves Dp © Do, provided 2 is complete, so 
as in the proof of Proposition 2.4, we deduce that A is essentially self-adjoint on 
Do; a fortiori it is essentially self-adjoint on the space (2.19). 


By similar reasoning, we can show that if is complete, then A is essentially 
self-adjoint on 


(2.21) {u € C&(Q) : 0,u = 0 on OO}. 

The results of this section so far have involved only the Laplace operator A. It 
is also of interest to look at Schrédinger operators, of the form —A+V, where the 
“potential” V (2) is a real-valued function. In this section we will restrict attention 
to the case V € C°(Q) and we will also suppose that V is bounded from below. 
By adding a constant to —A + V, we may as well suppose 
(2.22) V(x) >1 on. 

We can define a Hilbert space Hj)(Q) to be the closure of C§°(Q) in the space 
(2.23) Hi (Q) = {u € L7(Q): du € L7(0, A), V/?u € L?(Q)}, 


with inner product 


(2.24) (u, v)i,v = (du, dv) p2 + (Vu, v) p2. 


2. Self-adjoint differential operators 117 


Then there is a natural injection Hy,,(Q)  L*(Q), and the Friedrichs extension 
method provides a self-adjoint operator A. Integration by parts in (2.24), with 
vu € C5°(Q), shows that such A is an extension of —A + V. For this self-adjoint 
extension, we have 


(2.25) D(A?) = Ht, (9). 


In case 2 is a smooth, compact Riemannian manifold with boundary and 
V € C™(Q), one clearly has Hy;,(Q) = Hj(Q). In such a case, we have an 
immediate extension of Proposition 2.1, including the characterization (2.4) of 
D(—A + V). One can also easily extend Proposition 2.2 to -A + V in this case. 
It is of substantial interest that Proposition 2.4 also extends, as follows: 


Proposition 2.7. If Q is a complete Riemannian manifold and the function V € 
C™(Q) satisfies V > 1, then —A + V is essentially self-adjoint on Cf? (Q). 


Proof. We can modify the proof of Proposition 2.4; replace A — 1 by A — V in 
(2.11) and (2.12). Then U(t) gives a unitary group on Hy,,(Q) 6 L?(Q), and the 
finite propagation speed argument given there goes through. As before, all powers 
of —A + V are essentially self-adjoint on C§°(Q). 


Some important classes of potentials V have singularities and are not bounded 
below. In §7 we return to this, in a study of the quantum mechanical Coulomb 
problem. 

We record here an important compactness property when V € C%(Q) tends 
to +00 at infinity in Q 


Proposition 2.8. If the Friedrichs extension method described above is used to 
construct the self-adjoint operator —A + V for smooth V > 1, as above, and if 
V > +00 at infinity (i.e., for each N < wo, Qn = {4 € 2: V(x) < N} is 
compact), then —A + V has compact resolvent. 


Proof. Given (2.25), it suffices to prove that the injection Hj,)(Q) > L?(Q) 
is compact, under the current hypotheses on V. Indeed, if {u,,} is bounded in 
H},(Q), with inner product (2.24), then {du,,} and {V!/2u,,} are bounded in 
L?(Q). By Rellich’s theorem and a diagonal argument, one has a subsequence 
{un, } whose restriction to each Qy converges in L?(Qj,)-norm. The bound- 
edness of {V!/?u,,} in L?(Q) then gives convergence of this subsequence in 
L?(Q)-norm, proving the proposition. 


The following result extends Proposition 2.4 of Chap. 5 


Proposition 2.9. Assume that Q is connected and that either Q is compact or 
V — +00 at infinity. Denote by Xo the first eigenvalue of —A + V. Then 
a Xo-eigenfunction of —A + V is nowhere vanishing on Q.. Consequently, the 
Ao-eigenspace is one-dimensional. 


118 8. Spectral Theory 


Proof. Let u be a \o-eigenfunction of -A+V. As in the proof of Proposition 2.4 
of Chap.5, we can write u = ut + u_, where ut (x) = u(x) for u(x) > 0 
and u~ (a) = u(x) for u(x) < 0, and the variational characterization of the 
Ao-eigenspace implies that u* are eigenfunctions (if nonzero). Hence it suffices 
to prove that if u is a Ag-eigenfunction and u(az) > 0 on Q, then u(x) > 0 on. 
To this end, write 


u(r) = O-V 9) u() = fi pel, y)u(y) €V (y) 
Qa 


We see that this forces p:(x, y) = 0 for all t > 0, when 
zé€X={z:u(z)=0}, yeO, O={zx: u(x) > 0}, 


since p;(x, y) is smooth and > 0. The strong maximum principle (see Exercise 3 
in §1 of Chap. 6 forces © = 0. 


Exercises 


1. Let Hj; (Q) be the space (2.23). If V > 1 belongs to C°°(Q), show that the Friedrichs 
extension also defines a self-adjoint operator Ai, equal to -A + V on C9? (Q), such 
that D(At/ ) = H4(Q). If Q is complete, show that this operator coincides with the 
extension A defined in (2.25). Conclude that, in this case, Hy (Q) = Hjo(). 

2. Let 2 be complete, V > 1 smooth. Show that if A is the self-adjoint extension of 
—A + V described in Proposition 2.7, then 


(2.26) D(A) = {ue L?(2): —Au+ Vue L?()}, 


where a priori we regard —Au + Vw as an element of D’(Q). 
3. Define T : L?(Q) > L7(Q, A*) © L?(Q) by D(L) = Hyo(Q), Tu = (du, V4/?u). 
Show that 


(2.27) D(T*)={(v1,v2)EL?(Q, A) © L?(Q) : v1 € L?(Q), V"/? v2 € L?()}. 


Show that T*T is equal to the self-adjoint extension A of —A + V defined by the 
Friedrichs extension, as in (2.25). 

4. If Q is complete, show that the self-adjoint extension A of —A + V in Proposition 2.7 
satisfies 


(2.28) D(A) = {ue L?(Q): Aue L?(Q), Vu € L7(Q)}. 


(Hint: Denote the right side by W. Use Exercise 3 and A = T*T to show that 
D(A) C W. Use Exercise 2 to show that W C D(A).) 

5. Let D = —id/dx on C™(R), and let B(x) € C%(R) be real-valued. Define the 
unbounded operator L on L?(R) by 


(2.29) D(L) = {ue L?(R): Due L’(R), Bue L?(R)}, Lu= Du+iB(a)u. 


3. Heat asymptotics and eigenvalue asymptotics 119 


Show that L* = D — iB, with 
D(L*) = {u € L?(R) : Du—iBue L’(R)} 
Deduce that Ag = L*L is given by Agu = D?u + B?u+ B’(x)u on 
D(Ao) = {u€ L?(R): Du€ L?(R), Bu € L?(R), D?u+B?u+B' (x)u € L?(R)} 


6. Suppose that |B’(ax)| < 0B(x)? + C, for some 3 < 1, C < oo. Show that 


D(Ao) = {u € L?(R): D?u+ (B? + B’)u € L?(R)} 


(Hint: Apply Exercise 2 to D? + (B? + B’) = A, and show that D(A!/) is given by 
D(L), defined in (2.29).) 

7. In the setting of Exercise 6, show that the operator L of Exercise 5 is closed. 
(Hint: L*L = A is a self-adjoint extension of D? + (B? + B’). Show that mA) 
= D(L) and also = D(L).) Also show that D(L*) = D(L) in this case. 


3. Heat asymptotics and eigenvalue asymptotics 


In this section we will study the asymptotic behavior of the eigenvalues of the 
Laplace operator on a compact Riemannian manifold, with or without boundary. 

We begin with the boundaryless case. Let M/ be a compact Riemannian man- 
ifold without boundary, of dimension n. In §13 of Chap. 7 we have constructed a 
parametrix for the solution operator e’“ of the heat equation 


3.1) (2 = A)u =QonR*+xM, u(0,x) = f(z) 


and deduced that 

(3.2) Tr ef wt-"/2 (ag tart tact? +--+), t\0, 
for certain constants a;. In particular, 

(3.3) ao = (4n)~"/? vol M. 


This is related to the behavior of the eigenvalues of A as follows. Let the eigen- 
values of —A be 0 = Ag < AL < Ag < ++: 7 ow. Then (3.2) is equivalent to 


(3.4) Soe wt /FA(ag + ait+ art? +--+), t\,0. 
j=0 


We will relate this to the counting function 


120 8. Spectral Theory 
(3.5) N(A) ~ #4{Az 2 Az < A}, 
establishing the following: 


Theorem 3.1. The eigenvalues {, } of —A on the compact Riemannian manifold 
M have the behavior 


(3.6) N(A) ~ C(M)A"”?, —-X +00, 
with 
(3.7) C(M) = ao - vol M 


T@ +l) Te@+D4n 


That (3.6) follows from (3.4) is a special case of a result known as Karamata’s 
Tauberian theorem. The following neat proof follows one in [Si3]. Let w be 
a positive (locally finite) Borel measure on [0,co); in the example above, 


w(l0,A]) = VQ). 


Proposition 3.2. [f ju is a positive measure on (0,00), a € (0,00), then 


(3.8) [ 6 dein) oar, x0, 
implies 

(3.9) i. du(d) ~ ba, « on, 
with 

(3.10) b= TEES 


Proof. Let dj; be the measure given by p4(A) = t®y(t~1'A), and let dv()) 
= ad°—!d); then 4, = v. The hypothesis (3.8) becomes 


(3.11) tim fe dpi(A) = b fe dv(d), 
with b given by (3.10), and the desired conclusion becomes 
G2) tiny f x(2) dua(d) = f x(a) do(a) 


when y is the characteristic function of [0, 1]. It would suffice to show that (3.12) 
holds for all continuous (A) with compact support in [0, co). 


3. Heat asymptotics and eigenvalue asymptotics 121 


From (3.11) we deduce that the measures e~*dj1; are uniformly bounded, for 
€ (0, 1]. Thus (3.12) follows if we can establish 


(3.13) tim f g(A)e —* dui(d) = b fe dv(X), 


for g in a dense subspace of Co(R*), the space of continuous functions on {0, 00) 
that vanish at infinity. Indeed, the hypothesis implies that (3.13) holds for all g 
in 2, the space of finite, linear combinations of functions of A € [0,00) of the 
form y.(A) = e~**, s € (0,00), as can be seen by dilating the variables in 
(3.11). By the Stone-Weierstrass theorem, 2 is dense in C,(R*), so the proof is 
complete. 


We next want to establish similar results on N(A) for the Laplace operator A 
on a compact manifold Q with boundary, with Dirichlet boundary condition. At 
the end of §13 in Chap.7 we sketched a construction of a parametrix for e’* in 
this case which, when carried out, would yield an expansion 


(3.14) Tr ef 4-2 (ag + ayjot/? +aitt+---), t\0, 


extending (3.2). However, we will be able to verify the hypothesis of Proposition 
3.2 with less effort than it would take to carry out the details of this construction, 
and for a much larger class of domains. 

For simplicity, we will restrict attention to bounded domains in R” and to the 
flat Laplacian, though more general cases can be handled similarly. Now, let 2 be 
an arbitrary bounded, open subset of R”, with closure 2. The Laplace operator 
on 2, with Dirichlet boundary condition, was studied in §5 of Chap. 5 


Lemma 3.3. For any bounded, open Q C R"”, A with Dirichlet boundary 
condition, e‘* is trace class for allt > 0. 


Proof. Let 2 C B, a large open ball. Then the variational characterization of 
eigenvalues shows that the eigenvalues X,;((Q) of -A on 2 and \;(B) of L = —A 
on B, both arranged in increasing order, have the relation 


(3.15) Aj(O) > A5(B), 


But we know that e~‘” has integral kernel in C°(B x B) for each t > 0, hence is 
trace class. Since e~'i(2) < es (), this implies that the positive self-adjoint 
operator e'* is also trace class. 


Limiting arguments, which we leave to the reader, allow one to show that, even in 
this generality, if H(t,x, y) € C™(Q x Q) is, for fixed ¢ > 0, the integral kernel 
of e4 on L?(Q), then 


(3.16) Tre“ = [ He22) dx 
a 


122 8. Spectral Theory 


See Exercises |—5 at the end of this section. 


Proposition 3.4. [f Q is a bounded, open subset of R" and A has the Dirichlet 
boundary condition, then 


(3.17) Tre ~ (4nt)~"/? volAD, t\,0. 
Proof. We will compare H(t, x,y) with Ho(t,z,y) = (Ant)—"/2¢le—vl?/4#, the 


free-space heat kernel. Let E(t, x, y) = Ho(t, x,y) — H(t, x,y). Then, for fixed 
yeEQ, 


OE 
(3.18) pp 7 AoE =O on™ xQ, E(0,2,y) =9, 
and 
(3.19) E(t, x,y) = Ao(t, x,y), for x € OQ. 


To make simple sense out of (3.19), one might assume that every point of 0Q 
is a regular boundary point, though a further limiting argument can be made to 
lift such a restriction. The maximum principle for solutions to the heat equation 
implies 


(3.20) 0< E(t,x,y)< sup — Ho(s,z,y) < sup (4ms)—"/? e9W)"/4s, 
0<s<t,zEQ O<s<t 


where d(y) = dist(y, OQ). Now the function 
W5(s) = (4rs)~"/2e-%/48 


on (0,00) vanishes at 0 and oo and has a unique maximum at s = 67/2n; we 
have w5(62/2n) = C,,6—”. Thus 


(3.21) 0.< E(t,x,y) < max((Amt)-"/e-F"/44, ©, 5(y)-"). 
Of course, E(t, x,y) < Ho(t, x, y) also. 

Now, let 0 CC 2 be such that vol(Q \ O) < e. For ¢ small enough, namely 
for t < 67/2n where 6, = dist(O, OQ), we have 


(3.22) 0 < E(t,z,z) < (4nt)~"/2e 9)" /4# gp EO, 


while of course 0 < E(t,2,2) < (4nt)~"/?, for 2 € © \ O. Therefore, 


t—0 


(3.23) lim sup (4nt)"/? / E(t,2,2) dr <e, 
Q 


3. Heat asymptotics and eigenvalue asymptotics 123 


so 
vol Q — € < liminf (ant)? f H(t,2,2) dz 
(3.24) . 
< lim sup (ant)? | H(t,2,2) dz < vol Q. 
t—0 
Q 


As € can be taken arbitrarily small, we have a proof of (3.17). 


Corollary 3.5. [fQ is a bounded, open subset of R”, N(A) the counting function 
of the eigenvalues of —A, with Dirichlet boundary condition, then (3.6) holds. 


Note that if O, is the set of points in 2 of distance > ¢ from OQ and we define 
v(e) = vol(Q \ O-), then the estimate (3.24) can be given the more precise 
reformulation 


(3.25) 0< vol Q — (4nt)”/? Tre < w(V2nt), 
where 
(3.26) w(e) = v(e) +f ene" /26" dv(s). 

SS 


The fact that such a crude argument works, and works so generally, is a special 
property of the Dirichlet problem. If one uses the Neumann boundary condition, 
then for bounded 2 C R” with nasty boundary, A need not even have compact 
resolvent. However, Theorem 3.1 does extend to the Neumann boundary con- 
dition provided OQ is smooth. One can do this via the sort of parametrix for 
boundary problems sketched in §13 of Chap. 7. 

We now look at the heat kernel H(t, x,y) on the complement of a smooth, 
bounded region kK C R”. We impose the Dirichlet boundary condition on OK. 
As before, 0 < H(t,x,y) < Ho(t, x,y), where H(t, x, y) is the free-space heat 
kernel. We can extend H(t, x, y) to be Lipschitz continuous on (0, co) x R” x R” 
by setting H(t,x,y) = O when either x € K or y € K. We now estimate 
E(t,x,y) = Ao(t, x,y) — H(t, x,y). Suppose K is contained in the open ball of 
radius R centered at the origin. 


Lemma 3.6. For |x — y| < |y| — R, we have 
(3.27) E(t, x,y) < Ct-'/2¢n(ll-R)?/4t, 


Proof. With y € 2 = R” \ K, write 


(3.28) H(t,z,y) = (ant) -¥? f e~* /4* cog sA ds, 


—Cco 


124 8. Spectral Theory 


where A = \/—A and A is the Laplace operator on Q, with the Dirichlet boundary 
condition. We have a similar formula for Ho(t, x, y), using instead Ag = /—Ao, 
with Ag the free-space Laplacian. Now, by finite propagation speed, 


cos sA dy(x) = cos sAo dy (a), 
provided 
|s| <d= dist(y,OK), and |~a—y|<d 


Thus, as long as |a — y| < d, we have 


(3.29) E(t, x,y) = (4nt)~1/? / oe [cos sAo 6,(x) — cos sA 5,(x)] ds. 
|s|>d 

Then the estimate (3.27) follows easily, along the same lines as estimates on heat 

kernels discussed in Chap. 6, §2. 


When we combine (3.27) with the obvious inequality 
(3.30) 0 < E(t,2,y) < Ho(t,x,y) = (ant) "ela, 


we see that, for each t > 0, E(t, x,y) is rapidly decreasing as |x| + |y| — oo. 
Using this and appropriate estimates on derivatives, we can show that E(t, x, y) 
is the integral kernel of a trace class operator on L?(IR"). We can write 


(3.31) Tr (e*4e — ep) = [eeea) dx, 
R 
where P is the projection of L?(IR”) onto L?(Q) defined by restriction to 2. Now, 


ast \, 0, (4rt)"/? E(t, x, x) approaches 1 on K and 0 on R” \ K. Together with 
the estimates (3.27) and (3.30), this implies 


(3.32) (4nt)”/? / E(t,x,x2) dx —> vol K, 
R” 
as t \, 0. This establishes the following: 


Proposition 3.7. If Kk is a closed, bounded set in R", A is the Laplacian on 
L?(R” \ K), with Dirichlet boundary condition, and Ao is the Laplacian on 
L?(R®), then e'4° — e'P is trace class for each t > 0 and 


(3.33) Tr (en — oT) ~ (4nt)-"/? vol K, 


ast \, 0. 


This result will be of use in the study of scattering by an obstacle 1x, in Chap. 9. 
It is also valid for the Neumann boundary condition if 0K is smooth. 


4. The Laplace operator on S” 125 


Exercises 


In Exercises 1-4, let Q C R” be a bounded, open set and let O; be open with smooth 
boundary such that 


O1 CC 02 CC +: COCO; CC+s) ff Q. 


Let L; be —A on £;, with Dirichlet boundary condition; the corresponding operator 
on 2 is simply denoted —A. 
1. Using material developed in §5 of Chap. 5, show that, for any t > 0, f € L*(Q), 


e "5 P; f — e'“f strongly in L?(Q), 


as j — oo, where P; is multiplication by the characteristic function of O;. 
Don’t peek at Lemma 3.4 in Chap. 11! 
2. If AL(O;) are the eigenvalues of L;, arranged in increasing order for each j, show that, 
for each v, 
Av(O5) N Av(Q), as J > 00. 
3. Show that, for each t > 0, 
Tre 3 A Tr a, 
4. Let H;(t, x,y) be the heat kernel on Rt x O; x O;. Extend H; toRt x Q x Oso 
as to vanish if x or y belongs to 2 \ O,. Show that, foreach az € 2, yE QD, t > 0, 


Ay(t,2,y) 7 A(t, 2, y), as J — 00. 


Deduce that, for each t > 0, 


[ez ax / f H(t,2,2) dx 
O; Q 


5. Using Exercises 1-4, give a detailed proof of (3.16) for general bounded 2 C R”. 
6. Give an example of a bounded, open, connected set 2 C R? (with rough boundary) 
such that A, with Neumann boundary condition, does not have compact resolvent. 


4. The Laplace operator on S” 


A key tool in the analysis of the Laplace operator Ag on 5S” is the formula for 
the Laplace operator on R”+! in polar coordinates: 

Oe. 6:0 1 
4.1 A= As. 
oy) Of ror . 
In fact, this formula is simultaneously the main source of interest in Ag and the 
best source of information about it. 

The fact that we have already developed so many tools from functional anal- 
ysis, elliptic operator theory, and spectral theory allows us to give a relatively 
compact account of basic results of the spectral theory of Ag. For an account that 
makes use only of basic results of advanced calculus, see Chapter 7 of [T6]. 


126 8. Spectral Theory 


To begin, we consider the Dirichlet problem for the unit ball in Euclidean 
space, B = {x € R™*!: |z| < 1}: 


(4.2) Au=0inB, u=fonS"=0B, 


given f € D’(.S”). In Chap.5 we obtained the Poisson integral formula for the 
solution: 


1 _ 2 
(4.3) u(x) = — i ewe dS(y), 
sv 


where A,, is the volume of S”. Equivalently, if we set « = rw with r = |z|,w € S”, 


Lae fv") / 
(4.4) u(rw) = Ae / (1 drow! + Aw dS(w"). 
sn 


Now we can derive an alternative formula for the solution of (4.2) if we use 
(4.1) and regard Au = 0 as an operator-valued ODE in r; it is an Euler equation, 
with solution 


(4.5) u(rw) = poh ie), r<l, 
where A is an operator on D’(.S”), defined by 


(n — =e 


(4.6) A=(-As+ 7; 


If we set r = e~* and compare (4.5) and (4.4), we obtain a formula for the 


semigroup e~ ‘“ as follows. Let 6(w, w’) denote the geodesic distance on S” from 
w to w’, so cos O(w, w’) = w- w’. We can rewrite (4.4) as 


2 
u(rw) = — sinh(log r=!) r-("-D/2 


An 
(4.7) | Hw") mu 
2 cosh(logr—-!) — 2.cos O(w,w’)) "*?/? , 

a [2 cosh(log r—!) cos 0(w, w’)] 

In other words, by (4.5), 
2 / 

(4.8) e 4 f(w) = — sinh tf F(w") =e dS(w’). 
An . (2 cosh t — 2cos 0(w, w’)) 


Identifying an operator on D’(S”) with its Schwartz kernel in D’(S” x S$”), we 
write 


4. The Laplace operator on S” 127 


2 sinh t 
4.9 sa fap, 
i ° An (Qcosht — 2cos#yrrD/2" "~ 


Note that the integration of (4.9) from ¢ to oo produces the formula 
(4.10) A-te—*4 — 2C,,(2 cosh t —2cos@)—"-Y/?, ¢>0, 
provided n > 2, where 


1 1 n-1 
= — 277(nt+1)/2 
“Ga A, 4" r( D ) 


With the exact formula (4.9) for the semigroup e~'4, we can proceed to give 


formulas for fundamental solutions to various important PDE, particularly 


Oru ; 
(4.11) ap 7 Iu=0 (wave equation) 
and 

Ou ; 
(4.12) Be Lu=0 (heat equation), 
where 

—1)2 

(4.13) b= As—- Go ye 


If we prescribe Cauchy data u(0) = f, wz(0) = g for (4.11), the solution is 
(4.14) u(t) = (costA) f + A*(sintA)g. 


Assume n > 2. We obtain formulas for these terms by analytic continuation of 
the formulas (4.9) and (4.10) to Re ¢ > 0 and then passing to the limit ¢ € iR. 
This is parallel to the derivation of the fundamental solution to the wave equation 
on Euclidean space in §5 of Chap. 3. We have 


A-lel#-9)4 — _9C,, [2 cosh(it — e) — 2cos 6] ne 
(4.15) 9 o 
elt—e)A — re sinh(it — €) [2 cosh(it — €) — 2.cos 6] i ae 


Letting « \, 0, we have 
Aq'sintA = 


G1) aan —2C,, Im (2 cosh € cost — 2i sinh esint — 2cos@)~("—))/? 
€ 


128 8. Spectral Theory 


and 


costA = 


4.17 —2 
i lim — Im(sin t)(2 cosh ¢ cost — 2i sinhe sint — 2cos @)~"+)/?, 


ENO An 
For example, on S' 2 we have, for0 <t <7, 


(4.18) A-'sintA = 2C2(2cos 6 — 2cost)~1/?, 6 < |t|, 
0, 6 > él, 


with an analogous expression for general t, determined by the identity 
(4.19) A-‘sin(t+2r)A=—AtsintA on D'(S7*), 


plus the fact that sintA is odd in t. The last line on the right in (4.18) 
reflects the well-known finite propagation speed for solutions to the hyperbolic 
equation (4.11). 

To understand how the sign is determined in (4.19), note that, in (4.15), 
with « > 0, for t = 0 we have a real kernel, produced by taking the —(n — 
1)/2 = —k + 1/2 power of a positive quantity. As ¢ runs from 0 to 27, the 
quantity 2 cosh(it — ¢) = 2 cosh ecost — 27 sinh esint moves once clockwise 
around a circle of radius 2(cosh” ¢ + sinh? c)1/?, centered at 0, so 2 cosh € cos t— 
2i sinh € sin t— 2 cos 6 describes a curve winding once clockwise about the origin 
in C. Thus taking a half-integral power of this gives one the negative sign in (4.19). 

On the other hand, when n is odd, the exponents on the right side of (4.15)- 
(4.17) are integers. Thus 


(4.20) A-‘sin(t+2m)A=A7!sintA on D'(S?*+1), 


Also, in this case, the distributional kernel for A~! sin tA must vanish for |t| 4 0. 
In other words, the kernel is supported on the shell @ = |¢|. This is the generaliza- 
tion to spheres of the strict Huygens principle. 

In case n = 2k + 1 is odd, we obtain from (4.16) and (4.17) that 


1 1 Oty. oe = 
(4.21) Av'sintA f(x) = (2k —1)!! Ga 5) (sin**™ aes )) xt 


and 


(4.22) costA f(x) = 


1 t. O5%,. wae 
eran 8 (Sas gq) (sin 9 Fle) a 


where, as in (5.66) of Chap. 3, (24 — 1)!! =3-5---(2k — 1) and 


4. The Laplace operator on S” 129 
(4.23) f(x,s) = mean value of f on U(x) = {fy € 8”: O(z, y) = |s|}. 


We can examine general functions of the operator A by the functional calculus 


(4.24) g(A)= en)? f g(t)e*4 dt = (2n)-1/? i g(t) cos tA dt, 


where the last identity holds provided g is an even function. We can rewrite this, 
using the fact that cos tA has period 27 in t on D’(S”) for n odd, period 47 for 
nm even. In concert with (4.22), we have the following formula for the Schwartz 
kernel of g(A) on D’(.$?**1), for g even: 


co 


(4.25) g(A) = (2n)-1/?(— ae xX a (0 + 2k). 


As an example, we compute the heat kernel on odd-dimensional spheres. Take 
g(A) = e-” Then g(s) = (2t)-2/2¢-97/4t sad 


(4.26) (20)? S$ G(s + km) = (Ant) V2) e422)" /4t — (5,2), 
k k 


where ¥(s, t) is a “theta function.” Thus the kernel of et” on S2k+1 ig given by 


(4.27) et” — 


(-= = =) ost): 


A similar analysis on S?* gives an integral, with the theta function appearing in 
the integrand. 

The operator A has a compact resolvent on L?(S”), and hence a discrete set 
of eigenvalues, corresponding to an orthonormal basis of eigenfunctions. Indeed, 
the spectrum of A has the following description 


Proposition 4.1. The spectrum of the self-adjoint operator A on L?(S™) is 
1 
(4.28) Specd = { 5(n-1) +b: 8 = 0,1,2,...}. 


Proof. Since 0 is the smallest eigenvalue of — Ag, the definition (4.6) shows that 
(n — 1)/2 is the smallest eigenvalue of A. Also, (4.20) shows that all eigenvalues 
of A are integers if n is odd, while (4.19) implies that all eigenvalues of A are 
(nonintegral) half-integers if n is even. Thus Spec A is certainly contained in the 
right side of (4.28). 

Another way to see this containment is to note that since the function u() 
given by (4.5) must be smooth at x = 0, the exponent of r in that formula can 
take only integer values. 


130 8. Spectral Theory 


Let V;, denote the eigenspace of A with eigenvalue 1, = (n — 1)/2 + k. We 
want to show that V; # 0 for & = 0,1,2,.... Moreover, we want to identify 
V;. Now if f € Vj, it follows that u(x) = u(rw) = r4-("—Y/? fw) = r* f(w) 
is a harmonic function defined on all of R”*', which, being homogeneous and 
smooth at « = 0, must be a harmonic polynomial, homogeneous of degree & in 
x. If H;, denotes the space of harmonic polynomials, homogeneous of degree k, 
restriction to S$" C R"*+ produces an isomorphism: 


(4.29) p: Hr — Vp. 


To show that each V;, 4 0, it suffices to show that each H;, # 0. 
Indeed, for c = (c1,..-,€n41) € C"*?, consider 


pe(e) = (cyt, +++ + Ons En41)*. 
A computation gives 


Ap-(x) = k(k — 1)(c,e)(cray +--+ + ceen)*—?, 
(c,¢) = cp to + eR 
Hence Ap, = 0 whenever (c, c) = 0, so the proposition is proved. 
We now want to specify the orthogonal projections E;, of L?(.$”) on Vy. We 
can attack this via (4.10), which implies 


(4.30) Yivite —tve Fy. (x, y) = 2Cp(2 cosh t — 2cos@)~("—Y/?, 


where 0 = 6(x, y) is the geodesic distance from x to y in S”. If we set r = e* 


and use 1, = (n — 1)/2 + k, we get the generating function identity 


Sirky; ~l Ey (x,y) = 2C,(1 — 2rcos6 + r2)~("-Y/2 


(4.31) i 7 
= S- r* pp (cos 8) 
k=0 
in particular, 
(4.32) Ex(2,y) = Ve pr(cos 8). 


These functions are polynomials in cos 0. To see this, set £ = cos @ and write 


(4.33) (1 —2tr +r?)-* = So cee) r*, 


4. The Laplace operator on S” 131 


thus defining coefficients C¢(t). To compute these, use 
a-ae= (FS a, 
j=0 


with z = r(2t — r), to write the left side of (4.33) as 


E (joa E551) (carer 


J 


ca¢('" ade ey ent 


Hence 


wn ceo Ten YC Jeo 


£=0 
These are called Gegenbauer polynomials. Therefore, we have the following: 


Proposition 4.2. The orthogonal projection of L?(.S") onto Vz has kernel 
ey 1 
(4.35) Ex(x,y) = 2Crv, Ce (cos0), a= 5 (n —1), 
with Cy, as in (4.10). 
In the special case n = 2, we have C2 = 1/47, and 4, = k + 1/2; hence 


2 1 
Cr” (cos 0) = “tt Px(cos 6), 


_ 2k+1 


(4.36) Ex(«,y) = 


where ena *(t) = P,.(t) are the Legendre polynomials. 
The trace of E;, is easily obtained by integrating (4.35) over the diagonal, to 
yield 


(4.37) Tr By = 2CpAnvy CO Y (1) = 25 Crone (a. 


n—1 
Setting ¢ = 1 in (4.33), so (1 — 2r + r2)~% = (1 — r)~?%, we obtain 


(4.38) Ce(1) = (* = - ~ ) Bae Pettit. 


Thus we have the dimensions of the eigenspaces V;,: 


132 8. Spectral Theory 


Corollary 4.3. The eigenspace V;, of —Ag on S", with eigenvalue 
1 


An = ge — Z(n— 1)? =k? + (n— Dk, 
satisfies 
: Q2kin-1lfktin—-2 k+n—-2 kt+n-1 


In particular, on S? we have dim V;, = 2k +1. 


Another natural approach to E;, is via the wave equation. We have 


a eee 
Ek, = = e vet eta Gy 
oP a. 
(4.40) 
1 Te 
=— cost(A — v,) dt, 
aT) 


where T = 7 or 27 depending on whether 7 is odd or even. (In either case, one 
can take T’ = 27.) In the special case of S*, when (4.18) is used, comparison of 
(4.36) with the formula produced by this method produces the identity 


1 f°? cos(k + 4)t 
4.41 P,,(cos 0) = = dt 
ay) (cos 6) a0 (2cost — 2cos6)1/2 ” 


for the Legendre polynomials, known as the Mehler-Dirichlet formula. 
Wave equation techniques also provide a valuable tool for the study of point- 
wise convergence of spherical harmonic expansions on S”, in [PT]. 


Exercises 


Exercises 1—5 deal with results that follow from symmetries of the sphere. The group 
SO(n + 1) acts as a group of isometries of S” C R”*+, hence as a group of unitary 
operators on L?(S"). Each eigenspace V; of the Laplace operator is preserved by this 
action. Fix p = (0,...,0,1) € S”, regarded as the “north pole.” The subgroup of 
SO(n + 1) fixing p is a copy of SO(n). 

1. Show that each eigenspace V; has an element u such that u(p) # 0. Conclude by 
forming 


u(gx) dg 
SO(n) 
that each eigenspace V;, of Ag has an element z; 4 0 such that z;,(x) = z, (gx), for 
all g € SO(n). Such a function is called a spherical function, or a zonal function. 
2. Suppose V;, has a proper subspace W invariant under SO(n + 1). (Hence W+ C Vz 
is also invariant.) Show that W must contain a nonzero spherical function. 


Exercises 133 


3. Suppose z, and yx are two nonzero spherical functions in V;. Show that they must 
be multiples of each other. Hence the unique spherical functions (up to constant mul- 
tiples) are given by (4.35), with y = p. (Hint: z, and yx are eigenfunctions of —Ag, 
with eigenvalue A; = k? + (n — 1)k. Pick a sequence of surfaces 


Xu; = {x € 8”: O(x,p) =e;} CS”, 


with ¢; — 0, on which z, = a; 4 0. With GB; = yx|y,, it follows that 3; z, — ajyx 
is an eigenfunction of —A¢g that vanishes on /;. Show that, for 7 large, this forces 
Bj Z~ — ajYx to be identically zero.) 

4. Using Exercises 2 and 3, show that the action of SO(n + 1) on each eigenspace Vz is 
irreducible, that is, V, has no proper invariant subspaces. 

5. Show that each a is equal to the linear span of the set of polynomials of the form 
pe(x) = (citi +--+ + Cn412n41)*, with (c,c) = 0. 
(Hint: Show that this linear span is invariant under SO(n + 1).) 

6. Using (4.9), show that 


A_ 2 sinh t 
(2 cosh t — 2)(™+1)/2° 


(4.42) Tre’ 


Find the asymptotic behavior as t \, 0. Use Karamata’s Tauberian theorem to deter- 
mine the asymptotic behavior of the eigenvalues of A, hence of —Ags. Compare this 
with the general results of §3 and also with the explicit results of Corollary 4.3. 

7. Using (4.27), show that, for A on S” with n = 2k +1, 


k 
Tr oe t4” _ Agk+1 ( 1 s 0 ) -8?/4t 40k) 
(4.43) Ant 2 sin 8 OO 6=0 
= (4nt)—"/? Aor+i + Os), 
as t \, 0. Compare the general results of §3. 
8. Show that 
os e HADI) Fw) = f(-w),  f € 1?(8"). 


(Hint: Check it for f € V,, the restriction to S” of a homogeneous harmonic 
polynomial of degree k.) 
Exercises 9-13 deal with analysis on S” when n = 2. When doing them, look for 
generalizations to other values of n. 

9. If &(A) has integral kernel K=(a, y), show that when n = 2, 


(4.45) Kz(2,y) = — )- (2041) B(e+ + 5) Pe(cos6), 


where cos 6 = x - y and Pe(t) are the Legendre polynomials. 
10. Demonstrate the Rodrigues formula for the Legendre polynomials: 


lL (dy* 
(4.46) Pelt) = se (5) @- 1)". 


(Hint: Use Cauchy’s formula to get 


134 8. Spectral Theory 


P,(t) = = [a-2e+2y7 ir aus | 


from (4.33); then use the change of variable 1 — uz = (1— 2tz+ ayy, Then appeal 
to Cauchy’s formula again, to analyze the resulting integral.) 

11. If f € L?(S?) has the form f(x) = g(a-y) = Si yePo(a - y), for some y € S?, 
show that 


alt 


tay ee ae VCMT Oe (e+5) [ g(t) Pe(t) dt. 


-1 


(Hint: Use f(j2 Ex (2, z)Ee(z,y) dS(z) = de Ee(a, y).) Conclude that g(x - y) is 
the integral kernel of (A — 1/2), where 


An : 
4.48 2)=—— pe =2 t)Po(t) dt. 
(4.48) WO = sprq vem anf oDPeO 


This result is known as the Funk-Hecke theorem. 
12. Show that, for x, y € S?, 


oo 


(4.49) etd — "(26+ 1) & jek) Pe(w- y), 
£=0 
where 
. mT \1/2 11 fey? f° Q\e  izt 
(4.50) je(z) = (=) Jenpl2) = 55 (=) fo-#* el dt. 


(Hint: Take g(t) = e’* in Exercise 11, apply the Rodrigues formula, and integrate by 
parts.) Thus e”***Y is the integral kernel of the operator 


(1/2)ni(A—1/2) .- 


A4re ja—1/2(k) 


For another approach, see Exercises 10 and 11 in 89 of Chap. 9. 
13. Demonstrate the identities 


(4.51) la = ae He et] P;(t) = 0Pr_1(t) 
and 
(4.52) “la Po P.(t)] 4 (2 + 1) P(t) = 0. 


Relate (4.52) to the statement that, for fixed y € S?, y(x) = Pe(ax - y) belongs to the 
€(€ + 1)-eigenspace of —As. 


Exercises 14-19 deal with formulas for an orthogonal basis of V;, (for S' 2) We will 
make use of the structure of irreducible unitary representations of SO(3), obtained in 
89 of Appendix B, Manifolds, Vector Bundles, and Lie Groups. 

We recall some of the results established there. The Lie algebra skew(3) of SO(3) is 
spanned by X1, X2, X3, satisfying [X;, Xj;+41] = Xj+42, j € Z/(3). If w is a unitary 
representation of SO(3), we set 


Exercises 135 


L; =dn(X;), Le =Lo Fils. 


It is shown in Appendix B that each irreducible unitary representation of SO(3) 
is equivalent to a representation D, (k € Zt) of SO(3) on Ctl x Pop, 
the space of polynomials on C”, homogeneous of degree 2k, having basis 
kj = 2fh j 2 , 0< 9 < 2k, on which 


Lip =U-k+ j)pei, Lepr = —(2k —j)yrj+i, L-grji = J~k,j-1- 


The element wy = (Yx,2k, Satisfying Liw, = ikwz,, is (up to scaling) the highest 
weight vector. 
The relevance to the action of SO(3) on L7(S?) = @V, is that, by Exercise 
4 above, the action on each eigenspace V; is irreducible, and, by Corollary 4.3, 
dim V;, = 2k +1. 

14. Show that the representation of SO(3) on V;, is equivalent to the representation D,, 
for each k = 0,1,2,.... 

15. Show that if we use coordinates (0, x) on S*, where 0 is the geodesic distance from 
(1,0, 0) and yw is the angular coordinate about the «1-axis in R®, then 


= 0 _- oti _o O 
(4.53) Li=gy Laie [+ ag + bcot@ 5]. 
16. Set 
(4.54) wk(ax) = (x2 + ia3)* = sin* 6 e*”. 


Show that wz € V; and that it is the highest-weight vector for the representation, so 
Iiwr = tk Wk 


17. Show that an orthogonal basis of V;, is given by 


2k 
Wr, L—-wre, oe gle Wk 


18. Show that the functions ¢,; = Li w,, 7 € {-k, -—k +1,...,k —1,k}, listed in 
Exercise 17 coincide, up to nonzero constant factors, with z,;, given by 


ZkO = Zk; 
the spherical function considered in Exercises 1-3, and, for1 < 7 <k, 
le L zx, Zkj = IA &% 
19. Show that the functions z;; coincide, up to nonzero constant factors, with 
(4.55) J” Pi(cos0), —k <j <k, 


where P/(t), called associated Legendre functions, are defined by 
i 5¢y — gyi/a( 20 
(4.56) Pi(t) = (-1) (1-2) (5) Py (t). 


20. For a formula for the projections alternative to (4.35), replace (4.30)-(4.31) by the 
following consequence of (4.4)—(4.5): 


136 8. Spectral Theory 


oe l= re f(w’) F 
s Ex fw) An (1 = 2rw + w! + r2)(r+1)/2 dS(w), 
k=0 gn 
to obtain 
Exf(w) = zt fleerre-a — 0497 -w)] Fle!) a8(W') 
gn 


By convention Cy’ = 0 fork < 0. Sum over 0 < k < N to get 


Swf(w) = qf [eee 0) + cf w wf) dS(w'), 
gn 


where Sy = Eo + E. +---+ En. 


5. The Laplace operator on hyperbolic space 


The hyperbolic space H” shares with the sphere S” the property of having 
constant sectional curvature, but for #1” it is —1. One way to describe H” is as a 
set of vectors with square length 1 in R”+! not for a Euclidean metric, but rather 
for a Lorentz metric 


(5.1) (u,v) = vp — + — ur tr, 
namely, 
(5.2) H” = {vy ER™*?!: (v,v) = 1, Uns > 0}, 


with metric tensor induced from (5.1). The connected component G' of the iden- 
tity of the group O(n, 1) of linear transformations preserving the quadratic form 
(5.1) acts transitively on H”, as a group of isometries. In fact, SO(n), acting on 


R” Cc R"*!, leaves invariant p = (0,...,0,1) € H” and acts transitively on the 
unit sphere in T,H”. Also, if A(u1,...,Un,Un+1)! = (wi,..-,Un4i,Un)’, then 
e'4 is a one-parameter subgroup of SO(n, 1) taking p to the curve 


= {(0, tee (OMe) : aaa iz ce =1,%n41 > O} 


Together these facts imply that #” is a homogeneous space. 
There is a map of 1” onto the unit ball in R”, defined in a fashion similar to 
the stereographic projection of S”. The map 


(5.3) s:H" —> B”={xeER": |a| <1} 
is defined by 


(5.4) s(@,2n41) = (1+ en41) “2. 


5. The Laplace operator on hyperbolic space 137 


The metric on H” defined above then yields the following metric tensor on B”: 
(5.5) ds* = 4(1 — |x|?) 2 da’. 


Another useful representation of hyperbolic space is as the upper half space 
Rv = {x € R” : x, > O}, with a metric we will specify shortly. In fact, with 
En = (Oy ices 051), 


(5.6) (x) = |e + en|~?2(a + en) — sen 


defines a map of the unit ball B” onto R”, taking the metric (5.5) to 


(5.7) ds =2,° 5 da. 
j=l 
The Laplace operator for the metric (5.7) has the form 


Au = » a” 8;(a2-" 8;u) 
(5.8) ar 
=o S- Fu + (2— n)aty Onu. 
j=l 
which is convenient for a number of computations, such as (5.9) in the following: 


Proposition 5.1. Jf A is the Laplace operator on H”, then A is essentially 
self-adjoint on C§°(H”), and its natural self-adjoint extension has the property 


(5.9) spec(—A) C Fon - 1)?, 00). 


Proof. Since 7” is a complete Riemannian manifold, the essential self-adjoint- 
ness on C§°(H”) follows from Proposition 2.4. To establish (5.9), it suffices to 
show that 
(n—1)? 
(—Au, u) 521") = — lellz2 rey: 


for all u € C§°(H”). Now the volume element on H”, identified with the upper 
half-space with the metric (5.7), is 7," dx, --- dx», so for such u we have 


138 8. Spectral Theory 


((-a- jm )me) 


(5.10) = | [ton — (—*)" a2” dar,---daty 


n—-1 


+ S- [imran dx ,---d&n. 
j=l 
Now, by an integration by parts, the first integral on the right is equal to 


(5.11) | [a (2,07) ° In dx, +++ dry. 


RY 
Thus the expression (5.10) is > 0, and (5.9) is proved. 


We next describe how to obtain the fundamental solution to the wave equation 
on H”. This will be obtained from the formula for S”, via an analytic continua- 
tion in the metric tensor. Let p be a fixed point (e.g., the north pole) in S”, taken 
to be the origin in geodesic normal coordinates. Consider the one-parameter fam- 
ily of metrics given by dilating the sphere, which has constant curvature kK = 1. 
Spheres dilated to have radius > 1 have constant curvature K € (0,1). On such 
a space, the fundamental kernel A~! sintA 6,(x), with 


(5.12) A= (-A+ a(n - yy”, 


can be obtained explicitly from that on the unit sphere by a change of scale. The 
explicit representation so obtained continues analytically to all real values of K 


and at k = —1 gives a formula for the wave kernel, 
1 1/2 
(5.13) Av! sintAd,(x) =R(t,p,2), A= (-4 —F(n- )?) 
We have 
_ 1; fs —(n—1)/2 
(5.14) Ré,p,2) = lim —2C,, Im [2 cos(it — €) — 2 cosh r] : 
€ 
where r = r(p,x) is the geodesic distance from p to x. Here, as in (4.10), 


Cr = 1/(n — 1) An. This exhibits several properties similar to those in the case 
of S” discussed in §4. Of course, for r > |t|, the limit vanishes, exhibiting the 
finite propagation speed phenomenon. Also, if n is odd, the exponent (n — 1) /2 
is an integer, which implies that (5.14) is supported on the shell r = |¢]. 

In case n = 3, a use of the Plemelj formula similar to that done in Chapter 3, 
85 yields the formula 


6. The harmonic oscillator 139 


sintA _ 6-1) 


(5.15) ri bp(2) nH. 


~ An sinhr’ 


In analogy with (4.25), we have the following formula for g(A)d,(«), for 
g € S(R), when acting on L?(H”), with n = 2k + 1: 


(5.16) (A) = (2m)? (- 2)" air, 


2x sinh r Or 


If n = 2k, we have 


g(A) = 


5.17 1 oo 1 s1 éyr _ 
_ is i ( 27 sinh s is) G(s) (cosh 5 — cosh r) "sinh sds. 
Tv i 


REMARK. It is also of interest to have formulas for the Schwartz kernel of 
(sint/—A)//—A on hyperbolic space. How to obtain such formulas from 
(5.13)-(5.14) follows from a general result relating wave equations and shifted 
wave equations, treated in Appendix B to this chapter. 


Exercises 


1. Ifm = 2k + 1, show that the Schwartz kernel of (—A — (n — 1)?/4 2)" on 7H”, 
for Im z > 0, is 


(5.18) G.(x,y) = 


x 1 ol Di, 


Qiz 27 sinh r Or 


where r = r(x, y) is geodesic distance, and the integral kernel of ettAtm—1)"/4) for 
t > 0, is 


Fi (x,y) = 


ee ae 
Ant 27 sinh r Or 


REMARK. The formula (5.18) leads to an analysis of the L?-spectrum of A on 
HH”, showing a dependence on p. This analysis is pursued further in [DST], yield- 
ing results of the L?-spectrum of A on a class of quotients of hyperbolic space. 


6. The harmonic oscillator 


We consider the differential operator H = —A + |2|? on L?(R”). By Proposition 
2.7, H is essentially self-adjoint on C§°(IR”). Furthermore, as a special case of 
Proposition 2.8, we know that H has compact resolvent, so L?(IR”) has an 
orthonormal basis of eigenfunctions of H. To work out the spectrum, it suffices to 
work with the case n = 1, so we consider H = D? + x”, where D = —id/dz. 


140 8. Spectral Theory 


The spectral analysis follows by some simple algebraic relations, involving the 
operators 


1 
a=D-w=- ( + x), 
i\ dz 
(6.1) 
at=D+iae= - (= -2) 
Note that on D’(R), 
(6.2) H=aat —I=ata+I, 
and 
(6.3) [H,a]=—2a, [H,at]=2a*. 


Suppose that u; € C'°°(R) is an eigenfunction of H, that is, 
(6.4) Uz E D(A), Hu; = AjUj- 
Now, by material developed in 82, 


D(H?) = {u€ L?(R) : Du € 17(R), au € L?(R)}, 
(6.5) 


D(H) = {ue L?(R): D?u+27u€ L?(R)}. 


Since certainly each u; belongs to D(H a 2), it follows that au; and atu; belong 
to L?(R). By (6.3), we have 


(6.6) H(au;) = (Aj —2)auj;, H(atuj) = (Aj + 2)at uy. 

It follows that au; and atu; belong to D(H) and are eigenfunctions. Hence, if 
(6.7) Eigen(\, H) = {u € D(H): Hu = Au}, 

we have, for all \ € R, 


a* : Bigen(A, H) > Eigen(A + 2, H), 


6.8 
oe a: Eigen(A + 2,H) > Eigen(,, H). 


From (6.2) it follows that (Hu, u) > |lu||72, for all w € Cg°(R); hence, in view 
of essential self-adjointness, 


(6.9) spec H C [l,co), forn=1. 


6. The harmonic oscillator 141 


Now each space Eigen(A, 1) is a finite-dimensional subspace of C°°(R), and, 
by (6.2), we conclude that, in (6.8), at is an isomorphism of Eigen(\;, H) 
onto Eigen(A; + 2,H), for each A; € spec H. Also, a is an isomorphism of 
Eigen(\,;, H) onto Eigen(A; — 2, H), for all A; > 1. On the other hand, a must 
annihilate Eigen(Ao, H) when Xo is the smallest element of spec H, so 


eis uo € Eigen(Ao, H) => up(x) = =n) 
= u(r) = Ke? /?, 
Thus 
(6.11) do=1, Eigen(1,H) = span(e~*’/?). 
Since e~* /? spans the null space of a, acting on C®(IR), and since each nonzero 


space Eigen(\;, H) is mapped by some power of a to this null space, it follows 
that, for n = 1, 


(6.12) spec H = {2k+1:k=0,1,2,...} 
and 
: 0 . x? /2 
(6.13) Eigen(2k + 1,H) = span({ (>—--«) e oe ys 
x 


One also writes 


k 2 2 
(6.14) (< = x) e-?/? — Ha) e-?/2, 


where H;,(x) are the Hermite polynomials, given by 


da 
(6.15) [k/2] . 4! . 
a 


We define eigenfunctions of H: 


ca 
Ox 


2 


k 
(6.16) hy(a) = ce ( _ x) en /? — Cr, (x)e-*” i. 


where c; is the unique positive number such that ||/;,.||,2¢@@) = 1. To evaluate c,, 
note that 


(6.17) lla* hallZ2 = (aa* hp, he) ze = 2(k + 1) [Ihe llZ2- 


142 8. Spectral Theory 


Thus, if ||h,||z2 = 1, in order for hy41 = ypaTth, to have unit norm, we need 
Ye = (2k + 2)-1/?. Hence 
(6.18) ce = [r/?2k (KN), 


Of course, given the analysis above of H on L?(R), then for H = —A + ||? 
on L?(R”), we have 


(6.19) spec H = {2k+n:k=0,1,2,...}. 

In this case, an orthonormal basis of Eigen(2k + n, H) is given by 

(6.20) ce, ++ Ce, He, (01)+* He, (ane 2, ky te thy =k, 
where k, € {0,...,&}, the H;, (a,_) are the Hermite polynomials, and the c;,,, are 
given by (6.18). The dimension of this eigenspace is the same as the dimension 
of the space of homogeneous polynomials of degree /& in n variables. 

We now want to derive a formula for the semigroup e~', t > 0, called the 
Hermite semigroup. Again it suffices to treat the case n = 1. To some degree 
paralleling the analysis of the eigenfunctions above, we can produce this formula 
via some commutator identities, involving the operators 
(6.21) X=D=-@, You", Z=00;+0;0 =22¢0,+1. 

Note that H = X + Y. The commutator identities are 
(6.22) [X,Y]=-2Z, [X,Z]=4x, [Y,Z] =—4Y. 
Thus, X,Y, and Z span a three-dimensional, real Lie algebra. This is isomor- 


phic to sl(2,R), the Lie algebra consisting of 2 x 2 real matrices of trace zero, 
spanned by 


(6.23) n= (5 a ee ie a= (5 mal 


We have 
(6.24) [ng,n_]=a, [nz,a]=—-2n,, [n_,a] =2n_. 
The isomorphism is implemented by 


(6.25) XIN, YoO?_, Zo —-2a. 


Now we will be able to write 


6. The harmonic oscillator 143 


(6.26) oe t(2n4+2n_) = en 2o1 (tn oe 273(t)a geen. 


as we will see shortly, and, once this is accomplished, we will be motivated to 
suspect that also 


(6.27) e tH _ en 1H) Xx e734 eT 2b) | 


To achieve (6.26), write 


—203 0 
6.28 ea =(4 
aia MD bl CaEy| 


and 


h 2¢ — sinh 2t U 7) 
6.29 —2e(n¢-+n_) _ ( COS = 
( ) 7 — sinh 2¢ cosh 2t Vv U 


Then (6.26) holds if and only if 


1 1 v 
6.30 = — = —_ = z= — =-— tanh 2t 
oe) Ya coshat? eens 
so the quantities o,;(t) are given by 
u 209(t) 
(6.31) o1(t) = o2(t) = 5 tanh 2t, e°73\"? = cosh 2t. 


Now we can compute the right side of (6.27). Note that 


pao) = (Ano)? f ee Mora(y) dy, 
(6.32) eeeijec 


e732 u(r) = e73 u(e?782). 


Upon composing these operators we find that, for n = 1, 


(6.33) ey(e) = 7 K,(«,y)u(y) dy, 


144 8. Spectral Theory 
with 


exp{ [—$(cosh 2t)(x? + y?) + xy] /sinh ath 


(6.34) Ki(a,y) = 17a 


(27 sinh 2t) 


This is known as Mehler’s formula for the Hermite semigroup. Clearly, for 
general n, we have 


(6.35) e u(r) = / Kn(t,x, y)u(y) dy, 
with 
(6.36) Ky (t,x, y) = K,(a1,41)--: Ke(an, Yn). 


The idea behind passing from (6.26) to (6.27) is that the Lie algebra homomor- 
phism defined by (6.25) should give rise to a Lie group homomorphism from (per- 
haps a covering group G of) SL(2, R) into a group of operators. Since this involves 
an infinite-dimensional representation of G' (not necessarily by bounded operators 
here, since e~'” is bounded only for t > 0), there are analytical problems that 
must be overcome to justify this reasoning. Rather than take the space to develop 
such analysis here, we will instead just give a direct justification of (6.33)—(6.34). 

Indeed, let v(t, a) denote the right side of (6.33), with u € L?(IR) given. The 
rapid decrease of K;(x, y) as |z|+|y| + oo, fort > 0, makes it easy to show that 


(6.37) u € L?(R) => v € C™((0, 00), S(R)). 
Also, it is routine to verify that 


Ov 
(6.38) a —Hv. 


Simple estimates yielding uniqueness then imply that, for each s > 0, 
(6.39) v(t + s,-) =e u(s,-). 


Indeed, if w(t, -) denotes the difference between the two sides of (6.39), then we 
have w(0) = 0, w € C(R*,D(H)), Ow/dt € C(R*, L?(R)), and 


d 
q eOllie = -2(Aw,w) < 0, 


so w(t) = 0, for all t > 0. 
Finally, as t \, 0, we see from (6.31) that each a;(t) \, 0. Since v(t, x) is also 
given by the right side of (6.27), we conclude that 


6. The harmonic oscillator 145 


(6.40) v(t,-) > uin L?(R), ast \, 0. 
Thus we can let s \, 0 in (6.39), obtaining a complete proof that e~'” 
by (6.33) when n = 1. 

It is useful to write down the formula for e~"* using the Weyl calculus, 
introduced in §14 of Chap. 7. We recall that it associates to a(x, €) the operator 


u is given 


tH 


a(X, D)u = (any f a(g,p)et@X+” ula) dq dp 
(6.41) bay 
= (2m f a(S eet 4uly) ay a. 


In other words, the operator a(X, D) has integral kernel K(x, y), for which 


a(X, D)u() = / Ka (a, y)uly) dy, 


given by 
x+y 


Kale,y) = 2)" f a 


Recovery of a(xz,€) from Ka(x,y) is an exercise in Fourier analysis. When 
it is applied to the formulas (6.33)—(6.36), this exercise involves computing a 
Gaussian integral, and we obtain the formula 


é) eil@—y)& dé 


(6.42) e t# — h,(X, D) 
on L?(R”), with 
(6.43) he(a,£) = (cosh t)~” e~ (anh (lel? +1617), 


It is interesting that this formula, while equivalent to (6.33)-(6.36), has a simpler 
and more symmetrical appearance. 

In fact, the formula (6.43) was derived in 815 of Chap.7, by a different 
method, which we briefly recall here. For reasons of symmetry, involving the 
identity (14.19), one can write 


(6.44) hi(x,€) = 9(t,Q), Q(z, €) = ||? + |€/?. 


Note that (6.42) gives 0; hi(X,D) = —Hh,(X,D). Now the composition for- 
mula for the Weyl calculus implies that h;(a,€) satisfies the following evolution 
equation: 


146 8. Spectral Theory 


Gn) Aon ae 


(6.45) = —Q(0, hal) — 5 {Q heda (a6) 
= —(la|? + |)? )he(w, €) + ile + OF )hi(a, €). 


Given (6.44), we have for g(t, Q) the equation 


“4 _ Og 


It is easy to verify that (6.43) solves this evolution equation, with ho(a, €) = 1. 
We can obtain a formula for 


(6.47) e 2%) — n@(X, D), 


for a general positive-definite quadratic form Q(x, €). First, in the case 
(6.48) O(z,6) => aj (2? +8), 4; > 0, 


it follows easily from (6.43) and multiplicativity, as in (6.36), that 


n n 


(649) hP(w,€) = T] (cosh tps) exp) — Stan tx) (23 + &) 


j= jet 


Now any positive quadratic form Q(x,&) can be put in the form (6.48) via a 
linear symplectic transformation, so to get the general formula we need only 
rewrite (6.49) in a symplectically invariant fashion. This is accomplished using 
the “Hamilton map” Fg, a skew-symmetric transformation on R?” defined by 


(6.50) Q(u,v) =o(u, Fgv), u,v € R, 


where @(u, v) is the bilinear form polarizing Q, and a is the symplectic form on 
R?”; o(u,v) =a: é'—2'- €ifu = (2,6), v = (2’, é’). When Q has the form 
(6.48), Fg is asum of 2 x 2 blocks : :) , and we have 
TH 
is 1/2 
(6.51) [[ (cosh tu)" = (det cosh ita) 


j=l 


Passing from FQ to 


Exercises 
(6.52) Ag = (-F3)"”, 
the unique positive-definite square root, means passing to blocks 
a) 
0 pj)’ 


and when Q has the form (6.48), then 


(6.53) S “(tanh ty1;)(} + &) = tQ(9(tAQ)6, 6), 


j=l 


where ¢ = (x, &) and 


tanh t 
(6.54) o(t) = = 
Thus the general formula for (6.47) is 
-1/2 
(6.55) h? (a, £) = (cosh tAg) eW tQ(V(HAQ)E.C), 


Exercises 


1. Define an unbounded operator A on L?(R) by 


D(A) = {ue L?(R): Due L?(R), cue L?(R)}, Au= Du-—iau. 


Show that A is closed and that the self-adjoint operator H satisfies 


H=A*A+I=AA*-I 


(Hint: Note Exercises 5—7 of §2.) 


147 


2. If Hx(x) are the Hermite polynomials, show that there is the generating function 


identity 


(Hint: Use the first identity in (6.15).) 
3. Show that Mehler’s formula (6.34) is equivalent to the identity 


nV _ 92)" V2 expf{ (1 _ 3?) [2rys _ (x? 4 y’)s"]} ; ee? +9?)/2. 


for 0 < s < 1. Deduce that 


148 8. Spectral Theory 
et J 
\* A; («)? = =(1- eye ie). js] <1. 


4. Using 


1 co 
HS = =f e 4-1 dt, Res>O, 
T(s) Jo 


find the integral kernel A(x, y) such that 
H-*u(e) = f Ad(z,y)uly) ay. 
Writing Tr H~* = f As(x,x) dx, Re s > 1, n = 1, show that 
1 oo yo 
= d 
(9) = EG [ vty 
See [Ing], pp. 41-44, for a derivation of the functional equation for the Riemann zeta 


function, using this formula. 
5. Let H. = —d? /dx? + wx. Show that e~* has integral kernel 


KY (a, y) = (4nt)~1/? + (2wt)'/? eV (2H) [ (cosh Sust)(n?+y?)— Amy] /4t 


where 


6. Consider the operator 


Q(X, D) = ee = ives): = (5- + iw) 


Note that Q(x, €) is nonnegative, but not definite. Study the integral kernel K(x, y) 
of e *2P) Show that 


K2(e,0) = (4nt)~* y(2ut) eT IN /4t, 
where 


T(z) = zcoth z. 


7. Let (wjx) be an invertible, n x n, skew-symmetric matrix of real numbers (so n must 
be even). Suppose 


Evaluate the integral kernel K (x, y), particularly at y = 0. 
8. In terms of the operators a,a* given by (6.1) and the basis of L?(R) given by (6.16)— 
(6.18), show that 


athe = V2kK +2 heii, ahy = V2k hp_r. 


7. The quantum Coulomb problem 149 
7. The quantum Coulomb problem 
In this section we examine the operator 
(7.1) Hu = —Au— K|2|~!u, 


acting on functions on R®. Here, K is a positive constant. 

This provides a quantum mechanical description of the Coulomb force between 
two charged particles. It is the first step toward a quantum mechanical description 
of the hydrogen atom, and it provides a decent approximation to the observed 
behavior of such an atom, though it leaves out a number of features. The most 
important omitted feature is the spin of the electron (and of the nucleus). Giving 
rise to further small corrections are the nonzero size of the proton, and relativis- 
tic effects, which confront one with great subtleties since relativity forces one 
to treat the electromagnetic field quantum mechanically. We refer to texts on 
quantum physics, such as [Mes], [Ser], [BLP], and [IZ], for work on these more 
sophisticated models of the hydrogen atom. 

We want to define a self-adjoint operator via the Friedrichs method. Thus we 
want to work with a Hilbert space 


(7.2) H= {u € L?(R*) : Vue L?(R%), / a|~*|u(a)|? da < co} y 
with inner product 
(7.3) (u,v)H = (Vu, Vv) p2 + A(u, v) r2 — K | \a|~1u(x)u(a) da, 


where A is a sufficiently large, positive constant. We must first show that A can be 
picked to make this inner product positive-definite. In fact, we have the following: 


Lemma 7.1. For all < € (0, 1], there exists C(e) < 00 such that 

(7.4) / |x| *]u(a)|? dx < el|Vulli2 + C(e)|lulliz, 
forallu € H'(R?). 

Proof. Here and below we will use the inclusion 

2n n 

7. H*(R") c L?(R” 2), o<s<t 

(7.5) (R") c D?(R"), Vpe oe)? Tse <5) 

from (2.42) of Chap.4. In Chap. 13 we will establish the sharper result that 


HS(R") C L2"/("—2s) IR"); for example, H! (IR?) C L®(IR*). We will also cite 
this stronger result in some arguments below, though that could be avoided. 


150 8. Spectral Theory 


We also use the fact that (if B = {|x| < 1} and xg(2) is its characteristic 
function), 
xpeV € L1(R*), forallg <3 


Here and below we will use V(z) = |a|~'. Thus the left side of (7.4) is 
bounded by 


(7.6) xBV loo Wellzoo + lullz2 < Cllullize aay + llellZacesy; 


where we can take any q’ > 3/2; take q’ € (3/2,3). Then (7.6) holds for some 
o <1, for which L?? (R*) > H?(R?). From this, (7.4) follows immediately. 


Thus the Hilbert space H in (7.2) is simply H+(IR°), and we see that indeed, 
for some A > 0, (7.3) defines an inner product equivalent to the standard one 
on H+(IR*). The Friedrichs method then defines a positive, self-adjoint operator 
H + AI, for which 


(7.7) D((H + AI)'/?) = H'(R%). 
Then 
(7.8) D(H) = {u € H'(R*): —Au— K|z|~“'u€ L?(R*)}, 


where —Au — K|x|~‘u is a priori regarded as an element of H~'(R*) if 
u € H'(R3). Since H?(R°) c L®(R%), we have 


(7.9) u € H?(R*) = |2|\"tu € L7(R°), 
SO 
(7.10) D(H) > H?(R°). 


Indeed, we have: 
Proposition 7.2. For the self-adjoint extension H of -A—K|x|~+ defined above, 
(7.11) D(H) = H?(R°). 


Proof. Pick \ in the resolvent set of H; for instance, \ € C\ R. If u € D(H) 
and (H — A)u = f € L?(R*), we have 


(7.12) u-KR)Vu=Rf = 9), 


where V(x) = |a|~! and Ry = (—A — A)~!. Now the operator of multiplication 
by V(x) = |z|~+ has the property 


(7.13) My : H'(R?) —> L?-*(R°), 


7. The quantum Coulomb problem 151 


for alle > 0, since H1(R*) c L°(R3)NL?(R) and V € L3-€ on |x| < 1. Hence 
My : H'(R°) — H-*(R°), 
for all ¢ > 0. Let us apply this to (7.12). We know that u € D(H) C D(H'/?) = 


H'(R3), so KR,Vu € H?-*(R°). Thus u € H?~*(R3), for all e > 0. But, for 
€ > 0 small enough, 


(7.14) My : H?-*(R°) —> L?(R°), 


so then u = KR)(Vu) + Ry f € H?(R°). This proves that D(H) c H?(R?) 
and gives (7.11). 


Since H is self-adjoint, its spectrum is a subset of the real axis, (—00, 00). We 
next show that there is only point spectrum in (—oo, 0) 


Proposition 7.3. The part of spec H lying in C \ |0,00) is a bounded, discrete 
subset of (—oo,0), consisting of eigenvalues of finite multiplicity and having at 
most {0} as an accumulation point. 


Proof. Consider the equation (H — \)u = f € L?(R°), that is, 
(7.15) (Adu RVuHF 


with V(x) = |z|~! as before. Applying R, = (—A — \)~! to both sides, we 
again obtain (7.12): 


(7.16) (I —- KR)My)u =9 = Ryf. 


Note that R) is a holomorphic function of A € C \ [0,0o), with values in 
L(L?(R°), H?(R?)). A key result in the analysis of (7.16) is the following: 


Lemma 7.4. For \ € C \ [0, 00), 
(7.17) R)My € K(L?(R°)), 
where K is the space of compact operators. 


We will establish this via the following basic tool. For A € C \ [0,00), y € 
Co(IR3), the space of continuous functions vanishing at infinity, we have 


(7.18) M,R) € K(L) and R\M, € K(L’). 


To see this, note that, for p € C5° (R3), the first inclusion in (7.18) follows from 
Rellich’s theorem. Then this inclusion holds for uniform limits of such y, hence 
for y € Co(R*). Taking adjoints yields the rest of (7.18). 

Now, to establish (7.17), write 


152 8. Spectral Theory 
(7.19) V=Vi+ Va, 


where Vj = wV, = € C§°(R3), v(x) = 1 for |z| < 1. Then Vo € Co(R*), so 
R)My, € K. We have V; € L4(R3), for all q € [1,3), so, taking g = 2, we have 


(7.20) My 277?) + OR) c AAR), 
for all ¢ > 0, hence 
(7.21) Ry My, : L?(R°) — H'/?-«(R%) c L?(R9). 


Given V; supported on a ball Br, the operator norm in (7.21) is bounded by 
a constant times ||V;||~2. You can approximate V; in L?-norm by a sequence 
w,; € C5°(R*). It follows that Ry My, is a norm limit of a sequence of compact 
operators on L?(R3), so it is also compact, and (7.17) is established. 

The proof of Proposition 7.4 is finished by the following result, which can be 
found as Proposition 7.4 in Chap. 9 


Proposition 7.5. Let O be a connected, open set in C. Suppose C(A) is a 
compact-operator-valued holomorphic function of X € O. If I — C(A) is invert- 
ible at one point p € O, then it is invertible except at most on a discrete set in O, 
and (I — C(A))~! is meromorphic on O. 


This applies to our situation, with C(A) = KR) My; we know that I — C(A) 
is invertible for all A € C \ R in this case. 

One approach to analyzing the negative eigenvalues of H is to use polar coor- 
dinates. If —K|x|~' is replaced by any radial potential V(|z|), the eigenvalue 
equation Hu = —Eu becomes 


Otu 2du i 
(7.22) ar + Or + 2 Asu— V(r)u = Eu. 


We can use separation of variables, writing u(r?) = v(r)y(@), where y is an 
eigenfunction of Ag, the Laplace operator on $?, 


1 
(7.23) Asp=—-Ay, A= (K+ ae = 


Then we obtain for v(r) the ODE 
2 
(7.24) u(r) + 7Y (r) + f(irjuir) =0, f(r) =-E->=-V(r). 
One can eliminate the term involving v’ by setting 
(7.25) w(r) =ru(r). 


Then 


7. The quantum Coulomb problem 153 
(7.26) w'(r) + f(r)w(r) = 0. 


For the Coulomb problem, this becomes 


(7.27) w"(r) 4 E4 Joo) =i 


r r2 


(7.28) w"(z) + 7+ = + i we) =f 
with 

K 1 iy 
(7.29) a= ee =A+t=(k+5) ; 


This in turn can be converted to the confluent hypergeometric equation 
(7.30) 2p!"(z) + (b — z)W"(z) — a(z) = 0 


upon setting 


(7.31) W(z) = 2th? e-2/2 Wiz), 
with 
1 K 
a@=p-x+-=k+41 ; 
(7.32) 2 2VE 


b= 2u+1=2k+4+2. 
Note that 7 and v are related by 
(7.33) v(r) = (2VE)* ree-2VEry(QVEr). 


Looking at (7.28), we see that there are two independent solutions, one behav- 
ing roughly like e~*/? and the other like e*/?, as z > +00. Equivalently, (7.30) 
has two linearly independent solutions, a “good” one growing more slowly than 
exponentially and a “bad” one growing like e*, as z —> +o0. Of course, for a 
solution to give rise to an eigenfunction, we need v € [7(Rt, r dr), that is, 
w € L?(Rt, dr). We need to have simultaneously w(z) ~ ce~7/? (roughly) as 
z —» +oo and w square integrable near z = 0. In view of (7.8), we also need 
v' € L?(Rt,r*dr). 

To examine the behavior near z = 0, note that the Euler equation associated 
with (7.28) is 


154 8. Spectral Theory 
27" 1 2 
(7.34) 2W"(2) + (5 — 2?) W(2z) = 0, 


with solutions z1/2+# and z1/2-#, ie, z*+1 and z-*, k =0,1,2,.... If k = 0, 
both are square integrable near 0, but for k > 1 only one is. Going to the confluent 
hypergeometric equation (7.30), we see that two linearly independent solutions 
behave respectively like z° and z~24 = z~?*—1 as z > 0. 

As a further comment on the case k = 0, note that a solution W behaving like 
2° at z = 0 gives rise to u(r) ~ C/rasr > 0, withc 4 0, hence v'(r) ~ —C/r?. 
This is not square integrable near 7 = 0, with respect to r? dr, so also this case 
does not produce an eigenfunction of H. 

If b ¢ {0,—1, —2,...}, which certainly holds here, the solution to (7.30) that 
is “good” near z = 0 is given by the confluent hypergeometric function 


(@)n ze 
(b)n nl? 


(7.35) 1 F(a; b; z) = S- 
n=0 


an entire function of z. Here, (a), = a(a+1)---(a+n-—1); (a)o = 1. If also 
a ¢ {0,—1, —2,...}, it can be shown that 


T 
(7.36) 1 F(a; b; z) ~ . e* ge, zZ— +00. 
a 


See the exercises below for a proof of this. Thus the “good” solution near z = 0 is 
“bad” as z — +00, unless a is a nonpositive integer, say a = —7. In that case, as 
is clear from (7.35), 1 F1(—J; b; z) is a polynomial in z, thus “good” as z —> +00. 
Thus the negative eigenvalues of H are given by —F, with 


K 
7.37 —= =jtk+l=n, 
(7.37) OVE j 
that is, by 
K?2 
(7.38) = 5,2 11,2; Sid ss 


Note that, for each value of n, one can write n = 7 + k +1 using n choices 
of k € {0,1,2,...,n — 1}. For each such k, the (k? + k)-eigenspace of As 
has dimension 24 + 1, as established in Corollary 4.3. Thus the eigenvalue 
—E = —K?/4n? of H has multiplicity 


n—1 


(7.39) S 0 (2k +1) =n?. 


k=0 


7. The quantum Coulomb problem 155 


Let us denote by V,, the n?-dimensional eigenspace of H, associated to the 
eigenvalue A, = —K?/4n?. 
The rotation group SO(3) acts on each V,,, via 


og) f(x) = f(g-*z), g € SO(3), « € R® 


By the analysis leading to (7.39), this action on V,, is not irreducible, but rather 
has n irreducible components. This suggests that there is an extra symmetry, and 
indeed, as W. Pauli discovered early in the history of quantum mechanics, there 
is one, arising via the Lenz vector (briefly introduced in 816 of Chap. 1), which 
we proceed to define. 

The angular momentum vector L = x x p, with p replaced by the vector 
operator (0/021, 0/0x2,0/0x3), commutes with H as a consequence of the 
rotational invariance of H. The components of L are 


(7.40) y= pee rr 


where (j,k, ¢) is a cyclic permutation of (1, 2,3). Then the Lenz vector is defined 
by 

1 x 
(7.41) B= —(Lxp-pxL) -=, 

K r 
with components B;, 1 < 7 < 3, each of which is a second-order differential 
operator, given explicitly by 


1 t; 
(7.42) B= Fe Ene + OpLy — LeOy Ox Le) - , 


where (j, k, @) is a cyclic permutation of (1, 2,3). A calculation gives 
(7.43) [H, Bj] =0, 


in the sense that these operators commute on C® (IR? \ 0). 

It follows that if u € V,,, then B;u is annihilated by H — A,,, on R?° \ 0. Now, 
we have just gone through an argument designed to glean from all functions that 
are so annihilated, those that are actually eigenfunctions of 7. In view of that, it 
is important to establish the next lemma 


Lemma 7.6. We have 
(7.44) Bj: Vn — Vn. 


Proof. Let u € V,,. We know that u € D(H) = H?(R3). Also, from the anal- 
ysis of the ODE (7.28), we know that u(x) decays as |z| — oo, roughly like 


e Pal" lel Tt follows from (7.42) that Bju € L?(IR3). It will be useful to obtain 
a bit more regularity, using V,;, C D(H”) together with the following. 


156 8. Spectral Theory 


Proposition 7.7. If u € D(H), then, for all e > 0, 


(7.45) u € H5/2-«(R3), 
Furthermore, 
(7.46) g € S(R°), g(0) =0 = gu € H’/?-*(R). 


Proof. We proceed along the lines of the proof of Proposition 7.2, using 
(7.12), ie., 


(7.47) w= KA Vut Byf, 
where f = (H — X)u, with \ chosen in C \ R. We know that f = (H — A)u 


belongs to D(H), so Ry f € H*(IR?). We know that u € H?(R3). Parallel to 
(7.13), we can show that, for all ¢ > 0, 


(7.48) My : H?(R°) — H*/2-«(R%), 


so KR,Vu € H®/2-*(R3). This gives (7.45). 
Now, multiply (7.47) by g and write 


(7.49) gu= KRygVut K[M,, Ry)]Vut gRyf. 
This time we have 
Myyv : H?(R°) —+ H8/?-*(R3), 
so RygVu € H*/?-*(R°), Furthermore, 
(7.50) [M,, Ry] = R) [A, M,] Ry : H°(R*) — H**?(R°), 


so [M,, R,])Vu € H"/2-£(R°). This establishes (7.46). 


We can now finish the proof of Lemma 7.6. Note that the second-order deriva- 
tives in B; have a coefficient vanishing at 0. Keep in mind the known exponential 
decay of u € V,,. Also note that M,,, ;,: H?(IR*) + H°/?~*(R3). Therefore, 


(7.51) u € V, => Bju € H/?-*(R3). 
Consequently, 


(7.52) A(Bju) € H~1/?-*(R3), and V(Bju) € L'(R%) + L?(R?). 


7. The quantum Coulomb problem 157 
Thus (H —,,)(B;u), which we know vanishes on R*\\0, must vanish completely, 
since (7.52) does not allow for a nonzero quantity supported on {0}. Using (7.8), 
we conclude that B;u € D(H), and the lemma is proved. 
With Lemma 7.6 established, we can proceed to study the action of B; and L; 
on V,,. When (j, &, £) is a cyclic permutation of (1,2, 3), we have 
(7.53) [L;, Ly] = Le, 


and, after a computation, 
4 
(7.54) [L;,Br] = Be, (Bj, Be] = ile 


Of course, (7.52) is the statement that L; span the Lie algebra so(3) of SO(3). 
The identities (7.54), when L; and B; act on V,,, can be rewritten as 


K 
(7.55) (Lj An] = Ae, [Aj An] = Av Ay = Bi. 
If we set 
1 1 
(7.56) M= 5(L+ A), N= 5(L—A), 
we get, for cyclic permutations (j, k, 2) of (1, 2,3), 
(7.57) [M;, My] = Me, [Nj,Ne] = Ne, [Mj, Nj] =0, 


which is clearly the set of commutation relations for the Lie algebra so(3) @so(3). 
We next aim to show that this produces an irreducible representation of SO(4) 
on V,,, and to identify this representation. A priori, of course, one certainly has a 
representation of SU(2) x SU(2) on V,,. 

We now examine the behavior on V,, of the Casimir operators MM 2 = 
M? + M3 + M3 and N°. A calculation using the definitions gives B- L = 0, 
hence A - L = 0, so, on V,, 


1 
M2 = N?2 q(A’ +L") 
(7.58) 
= ate, A 3) 
4 Arn 


We also have the following key identity: 
(7.59) K?(B? -1) =4H(L? +), 


which follows from the definitions by a straightforward computation. If we 
compare (7.58) and (7.59) on V,,, where H = Xp, we get 


158 8. Spectral Theory 


K?2 
(7.60) 4M? = 4N? = -(1 4. =)! on Vp. 


Now the representation o,, we get of SU(2) x SU(2) on V,, is a direct sum 
(possibly with only one summand) of representations D;/2 ® Dj/2, where Dj; /2 
is the standard irreducible representation of SU(2) on C/*1, defined in §9 of 
Appendix B. The computation (7.60) implies that all the copies in this sum are 
isomorphic, that is, for some j = j(n), 


a 
(7.61) On = D Din /2 B Dyn 2 
l=1 


A dimension count gives u(j(n) + 1)? = n?. Note that on Dj/2 ® Dj 2, we have 
M? = N? = (j/2)(j/2 + 1). Thus (7.60) implies j(j + 2) = —1+ K?/4Ap, or 


Kk? 


7.62 An = -— 
Ue) iG +1 


j = j(n). 
Comparing (7.38), we have (j + 1)? = n?, that is, 
(7.63) (isn, 


Since we know that dim V,, = n?, this implies that there is just one summand in 
(7.61), so 


(7.64) On = Dn-1)/2 ® D(n-1)/2- 


This is an irreducible representation of SU(2) x SU(2), which is a double cover 
of SO(4), 
& : SU(2) x SU(2) —> SO(4). 


It is clear that o,, is the identity operator on both elements in ker «, and so oy, 
actually produces an irreducible representation of SO(4). 

Let p,, denote the restriction to V,, of the representation p of SO(3) on L3(R°), 
described above. If we regard this as a representation of SU(2), it is clear that p,, 
is the composition of o,, with the diagonal map SU(2) > SU(2) x SU(2). Results 
established in 89 of Appendix B imply that such a tensor-product representation 
of SU(2) has the decomposition into irreducible representations: 


(7.65) Pn & @ Dx. 


This is also precisely the description of p,, given by the analysis leading to (7.39). 


Exercises 159 


There are a number of other group-theoretic perspectives on the quantum 


Coulomb problem, which can be found in [Eng] and [GS2]. See also [Ad] and 
[Cor], Vol. 2. 


Li; 


Exercises 


For H = —A — K|x|~' with domain given by (7.8), show that 


(7.66) D(H) = {u € L?(R*) : —Au— K|a2|~*u € L?(R*)}, 


where a priori, if w € L?(R°), then Au € H~?(R*) and |a|~tu € L*(R%) + 
L?(R*) C H~?(R?). 

(Hint: Parallel the proof of Proposition 7.2. If w belongs to the right side of (7.66), and 
if you pick \ € C \ R, then, as in (7.12), 


(7.67) u—KR)Vu= Ryf € H’(R*).) 


Complement (7.13) with 


My . L?(R*) a a) H~9/2-€ (3), 
e>0 


(7.68) 


My: 1) H1/2-©(R3) cs 0) HH -*(R*). 
e>0 6>0 
(Indeed, sharper results can be obtained.) Then deduce from (7.67) first that 


u € H'/?-©(R°) and then that u € H°/4~°(R°) C H*(R®).) 
As a variant of (7.4), show that, for u € H+ (R°), 


(7.69) [ier luey? ae <4 f ule) ax, 


Show that 4 is the best possible constant on the right. (Hint: Use the Mellin transform 
to show that the spectrum of r d/dr — 1/2 on L?(Rt,r~'dr) (which coincides with 
the spectrum of r d/dr on L?(R*, dr)) is {is — 1/2: s € R}, hence 


(7.70) [ |u(r)|?dr < af ju’ (r)|?r? dr. 
0 0 


This is sometimes called an “uncertainty principle” estimate. Why might that be? 
(Cf. [RS], Vol. 2, p. 169.) 


. Show that H = —A — K’/|z| has no non-negative eigenvalues, i.e., only continuous 


spectrum in [0, 00). (Hint: Study the behavior as r — +00 of solutions to the ODE 
(7.28), when —F is replaced by +E € [0, 00). Consult [Olv] for techniques. See also 
[RS], Vol. 4, for general results.) 


. Generalize the propositions of this section, with modifications as needed, to other 


classes of potentials V (a), such as 
VEL’ +eL”, 


the set of functions V such that, for each « > 0, one can write V = Vi; + Va, Vi € 
L?, ||Vallze < e. Consult [RS], Vols. 2-4, for further generalizations. 


160 8. Spectral Theory 


Exercises on the confluent hypergeometric function 


1. Taking (7.35) as the definition of 1 F'1 (a; b; z), show that 


T(a)l'(b-—a 
(7.71) Reb>Rea>0. 


1 
F(a; b; z) = ront—a f or ial ae 
0 


(Hint: Use the beta function identity, (A.23)-(A.24) of Chap.3.) Show that (7.71) 
implies the asymptotic behavior (7.36), provided Re b > Rea > 0, but that this is 
insufficient for making the deduction (7.37). 


Exercises 2—5 deal with the analytic continuation of (7.71) in a and 6}, and a 
complete justification of (7.36). To begin, write 


D6) 


(7.72) 1Fi(a;b}2) = FE Ava z)4 F(a) 4 a, z)e”, 
where, for Re c > 0,  € C*([0, 1/2]), we set 
ee een eee 
(7.73) Ay(c, z) = To / e **a(t)t~* dt, 


and, in (7.72), 
p(t) =(1-#)""°"*, p(t) = (1-4). 
2. Given Re c > 0, show that 


(7.74) Ay(c,z) ~ ¥(0)z7°, 2 +00, 


and 


v(4) —1 2/2 
3. For7 = 0,1,2,..., set 
1 1/2 eet 
17 Aj(c,t) = ~~ Se dt 
( 6) i(c, ) cal. e 5) 


so Aj(c, z) = Ay(c, z), with W(t) = t’. Show that 


I'(c+ 3) —c-—j 1 sa —zt,yc+tj-1 
A; = 7 to" dt 
HOD Ty Te) fe 


for Re z > 0. Deduce that A, (c, t) is an entire function of c, for Re z > 0, and that 


ifc ¢ {0,-1,—-2,...}. 
4. Given k = 1,2,3,..., write 


Exercises 161 


w(t) = ay tart +--+ + ay_it* 1 + de (t)t®, de € c= ((o, 51) 
Thus 
= 1 NOT kte-1 
(7.77) Ay(e,2) = Aste DTG [ ena (tte? at, 


Deduce that A,(c, z) can be analytically continued to Re c > —k when Re z > 0 
and that (7.74) continues to hold if c ¢ {0, -—1, -2,...}, ao £0. 
5. Using t°-' = c~!(d/dt)t® and integrating by parts, show that 


=. \ : ale 
(7.78) Anle2) =AANCT LA ana 


for Re c > 0, all z € C. Show that this provides an entire analytic continuation of 
Ao(c, z) and that (7.74)-(7.75) hold, for w(t) = 1. Using 


Met) 
Tc) 


and (7.77), verify (7.75) for all yy € C'° (0, 1/2]). (Also again verify (7.74)). Hence, 
verify the asymptotic expansion (7.36). 

The approach given above to (7.36) is one the author learned from conversations 
with A. N. Varchenko. In Exercises 6—15 below, we introduce another solution to the 
confluent hypergeometric equation and follow a path to the expansion (7.36) similar 
to one described in [Leb] and in [Olv]. 

6. Show that a solution to the ODE (7.30) is also given by 


Aj(c,z) = Ao(c + j; 2) 


zi? ifFi(1+a—b;2—6;z), 
in addition to ; F\ (a; b; z), defined by (7.35). Assume b 4 0, —1, —2,.... Set 


ris 


W(a; b; z) = —-——_ 1 F(ab; 
(a; 52) Tr(1+a-—b) 1. 1 (a; ;2) 

T(b—1) 4-5 ; ; 

(7.79) Tay 1Fi(1+a—b;2—0;z). 
Show that the Wronskian is given by 
ahs 2 Bye —. T(6) —b_z 
W (1 Fi(a; b; z), U(a; 0; z)) = “Way” 
7. Show that 

(7.80) 1 F(a; b; z) = e* 1Fi(b—a;b;—-z), b¢ {0,—-1,-2,...} 


(Hint: Use the integral in Exercise 1, and set s = 1—t, for the case Re b > Rea > 0.) 
8. Show that 


1 co 
(7.81) W(a;b;z) = 7 e 7*49-1(14- 4)" * 1 dt, Rea>O, Rez>0O. 
0 


Ta) 


(Hint: First show that the right side solves (7.30). Then check the behavior as z — 0.) 
9. Show that 


162 8. Spectral Theory 


(7.82) W(a;b;z) = zW(a+1;b4+1;2) + (1—a—b)U(a+ 15); z). 


(Hint: To get this when Re a > 0, use the integral expression (7.81) for 
W(a+1;b+1;z), write ze~** = —(d/dt)e~**, and integrate by parts.) 
10. Show that 
T(b) ss 


iF) (a; b;z) = Tia W (a; b; z) 


(7.83) 4 EO) (tm (a-0)i g# U(b — a:b; —2), 
(a) 


where —z = e*™'z, b#0,—1, —2,.... (Hint: Make use of (7.80) as well as (7.79).) 
11. Using the integral representation (7.81), show that under the hypotheses 6 > 0, b ¢ 
{0,—1,—2,...}, and Rea > 0, we have 


(7.84) Wiabsz)~z%, |z] 4 ~&, 
in the sector 
(7.85) |Arg z| < . =, 


12. Extend (7.84) to the sector |Arg z| < a — 06. (Hint: Replace (7.81) by an integral 
along the ray y = {e’“s : 0 < s < oo}, given |a| < 7/2.) 

13. Further extend (7.84) to the case where no restriction is placed on Re a. 
(Hint: Use (7.82).) 

14. Extend (7.84) still further, to be valid for 


(7.86) |Arg z| < a — 6. 


(Hint: See Theorem 2.2 on p. 235 of [Olv], and its application to this problem on 
p. 256 of [Olv].) 
15. Use (7.83)-(7.86) to prove (7.36), that is, 


he T'(b) z ,—(b-a) 
(7.87) 1 F(a; b; z) ~ T(a) ez , 23 +00, 
provided a,b ¢ {0,—-1,—2,...}. 
Remarks: For the analysis of U(b — a; b; —z) as z + +00, the result of Exercise 


14 suffices, but the result of Exercise 13 does not. This point appears to have been 
neglected in the discussion of (7.87) on p. 271 of [Leb]. 


8. Potential well—quantum model of a deuteron 


In this section, we consider a quantum model for the deuteron, and compute its 
ground state. We see that, in this model, the nucleons have a greater probability of 
lying outside the potential well than in it, as noted in nuclear physics texts, such 
as [BM], [F], and [S]. 


8. Potential well—quantum model of a deuteron 163 


We take a simple potential well. In more detail, given a, Vo € (0,00), « € R°, 
we set 


i V(a)=—-Vo, el <a, 
0, |a| >a. 

We consider whether —A + V has negative eigenvalues, and if so, how its ground 

state behaves. 

If —A + V has negative eigenvalues, denote by —F the one with largest 
absolute value. We must have E € (0,Vo), and the ground state will be given 
by a function w € C1(R®), rapidly decreasing at infinity, positive and radially 
symmetric, satisfying 


(8.2) Ay = [V(x)+ Ely on R’. 


In particular, with r = ||, 


(8.3) bz) = ——, 
where u € C'((0, 00)) satisfies 
(8.4) u(r) =[V(r) + E]u(r). 


The properties of F and w detailed above demand that, for some A, B € (0,00), 


u(r) = Asinkr, r<a, 


8.5 
2} Be” 


ra, 


% 


with 


(8.6) k= J/W—-E, y=VE. 


The fact that u € C+((0,00)) yields the relations 


(8.7) Asinka= Be, kAcoska=—Bye, 
hence 

(8.8) B= Ae sinka, kcotka=—vy. 
Also 


wy>O0O—0<ka<a 
(8.9) : ; 
= (YW - E)a’ <1’. 


164 8. Spectral Theory 


Note that A is a positive multiple of B; hence the second part of (8.7) yields 
T 
(8.10) coska <0, so ka> 3° 


Comparison with (8.9) gives 
T Ww 2 2 
(8.11) <ka<qa, hence 7 < (Yo — E)a’ <n’. 


In particular: 


Proposition 8.1. /f —A + V has a negative eigenvalue, then 
2 
(8.12) Voa? > a 


Given that there is a negative eigenvalue — with largest absolute value, we 
next strive for a formula: EF = E(Vo, a). To get this, it is convenient to set 


E=8V, 0<6<1, 


(8.13) T T 
ka=5 +6, Uae < 5 


and get formulas relating these quantities. Note that 


(8.14) weika= cot (7 4 e) = —tane, 


and bringing in (8.8) we have 


et. EO 6? 

(8.15) tne ta (Fa 6 € (0,1). 
Equivalently, 

(8.16) S=sine, 0<eK< i 


To continue, we have 


2 2_ ({% 7 
9 (w/2+e)? | 1 7 2 
ee = 1— 6? =sar(Fts) » 


respectively by (8.6), (8.13), and (8.16). This gives the sought-after relation: 


8. Potential well—quantum model of a deuteron 165 


Proposition 8.2. Given Voa? > 17/4, we have 


1 1 2 7 

Voa? = (; ) , O<e<e 

(8.18) a cos? e \2 os a 
=> E=YVsin’e, 


where —E is the negative eigenvalue of —A + V of largest absolute value. 


REMARK. Given e << 1, we have 


2 
(8.19) Voa? x a tne, Bx Voe?. 


Having these calculations, we desire to compute the integral of |w(x)|? over 
the respective regions {|x| < a} and {|z| > a}. We have 


[ WwePae=ae | eae 


|z|<a 


= 47 A? | sin? kr dr 
) 


_4 A2 mw /2+e 
(8.20) i | sin? s ds 


k 


= “wien (F+ +f cos” tt) 
VY —E 0 


: (S+ [ costeat) 


cose \4 0 


and 


166 8. Spectral Theory 


[lead = an f° ute ar 


|z|>a 


_ 4rB L659 
(8.21) 27 
4n A 
= : sin? ka 
7 
4n A? . 1 * ) 
= sin a tay 
2VE 2 


cos? « 47 A? 


_ 2sine VVo , 


The quantity A (which we need not compute) is the normalizing constant, making 
the two integrals above sum to 1. We see that, as € \, 0, 


ce ea 
| vol dew 


|z|<a 


9 7 27 AZ 
i jw(e)Pae~ 


|z|>a 


(8.22) 


so the integral over {|x| > a} is much larger than the integral over {|x| < a}, for 
€ small enough. 
As for how small < is, we note that (8.18) plus the identity E = 77 yield 


(8.23) (5 Es <) tane = ya. 


Information on a and on F (hence on y) would allow one to solve for ¢, and then 
for Vo = E/sin’. 

We next see how this plays out for the deuteron, for which (8.2) arises as a 
model for the ground state. Actually, (8.2) is the nondimensionalized form. The 
physical form is 


(8.24) Ay = am (a) + Ely), 


where, with m = mMpmy,/(Mp + Mn) Y M,/2 and c & 3 x 108 m/sec, 


2m % mass of a proton + 938 MeV/c’, 


(8.25) 5 
h = Planck’s constant ~ 6.6 x 10° ““ MeV-sec, 


8. Potential well—quantum model of a deuteron 167 


and V(x) and E are measured in MeV. This leads to (8.2) with 


2m~ 2m ~ 
= 52 V (2); B= Ee, 


(8.26) V(z) 2 


where V(x) = —Vo on |z| < a, and Vo = (2m/h?)Vo. Experiments shooting 
gamma rays at deuterium show that 


(8.27) E = 2.225 MeV. 

This corresponds via y = VE = V2mE/h to 

(8.28) y—1 = 4.32 fm, 

where | fm=10~1° m. The meson model of nuclear forces suggests 
(8.29) a 2.8 fm. 

Cf. [S], p. 449. This gives 

(8.30) ya = 0.648, 


and solving (8.23) then gives 


(8.31) e © 0.329. 
Hence 

(8.32) 6 = 0.323, 

so 

(8.33) Vo = 67-2 E & 21.34 MeV. 


Referring to (8.20)-(8.21), we see that in this case 


Bde es An A? 
(8.34) | vol de = (1.165) 
|x|<a 
and 
Oa. & 4n A? 
(8.35) | vol de ~ (1.387) 


|z|>a 


168 8. Spectral Theory 


The figure (8.31) for € is not terribly consistent with the hypothesis that 
€ << 1, though the figure (8.33) for Vo is consistent with Vy) >> EH. The figures 
for y and Vo given in (8.28) and (8.33) agree with those given in [S] (p. 449). 
We note however that the integral (8.35) is only a little larger than (8.34). This 
contrasts with the statement in [S] that it is “about twice as large.” 

In more detail, the ratio of (8.35) to (8.34) is 


(8.36) Reo 21,191, 


which is not close to 2. On the other hand, if we take € as in (8.31) and plug it 
into the “small ¢ approximation” (8.22), we get the “approximation” 


2 
(8.37) Rw — ~ 1.935, 
TE 


in close agreement with the calculation in [S]. However, (8.36) seems to be a 
more accurate calculation for the model at hand. 


Exercises 


1. Formulate conditions on Vo and a in (8.1) that imply that D = —A + V has 
exactly one negative eigenvalue. 


2. Give conditions on Vo and a under which L must have more than one negative 
eigenvalue. 


9. The Laplace operator on cones 


Generally, if N is any compact Riemannian manifold of dimension m, possibly 
with boundary, the cone over N, C(N), is the space Rt x N together with the 
Riemannian metric 


(9.1) dr? + 17g, 


where g is the metric tensor on NV. In particular, a cone with vertex at the origin 
in R™*! can be described as the cone over a subdomain Q of the unit sphere 
S™ in R™*?, Our purpose is to understand the behavior of the Laplace operator 
A, a negative, self-adjoint operator, on C(V). If ON #4 0, we impose Dirichlet 
boundary conditions on OC(N), though many other boundary conditions could 
be equally easily treated. The analysis here follows [CT]. 

The initial step is to use the method of separation of variables, writing A on 
C(N) in the form 


9. The Laplace operator on cones 169 


(9.2) Maat, at 


where Aj is the Laplace operator on the base N. Let ;, y;(x) denote the 
eigenvalues and eigenfunctions of —Ay (with Dirichlet boundary condition on 
ON if ON # 6), and set 

m—1 


(9.3) v5 = (mp +02)?, ag = 


If 
g(r, 2) =D 9(r)ej(x), 


with g; (7) well behaved, and if we define the second-order operator L,, by 


0? O 
(9.4) L,g(r) = ( i E ) g(r), 


Or? r Or r2 


then we have 


(9.5) Ag(r, x) = » Ly, 95(7)p;(2). 


In particular, 


(9.6) A( 9595) = — 9595 
provided 
(9.7) g(r) =r “YP Zr). 


Here J,,(z) is the Bessel function, introduced in §6 of Chap. 3; there in (6.6) it is 
defined to be 


(9.8) J,(z) = (2/2)” 5 [ (1 {2)"— 1/2 gist dt, 


for Re vy > —1/2; in (6.11) we establish Bessel’s equation 


a? 1d py? 
(9.9) E + oa + (1 =)| J,(z) = 0, 


which justifies (9.6); and in (6.19) we produced the formula 


aus —1)k z\2 
(9.10) t(2) = (5) Se) : 


170 8. Spectral Theory 


We also recall, from (6.56) of Chap. 3, the asymptotic behavior 


9.11) J(r)~ ee cos(r — eS *) +O(r-3/2), p> +00. 


Tr 


This suggests making use of the Hankel transform, defined for v € R* by 
(9.12) Hy, (g)(A) = | g(r) JL(Ar)r dr. 
0 


Clearly, H,, : C§° ((0, 00)) + L°°(R*). We will establish the following: 
Proposition 9.1. For v > 0, H, extends uniquely from C§° ((0, 00)) to 
(9.13) H, : L?(R*,r dr) —> L?(R*,Ad)), unitary. 
Furthermore, for each g € L?(Rt,r dr), 


(9.14) Ayo Hyg = 4g. 


To prove this, it is convenient to consider first 


(9.15) Hf) = ia flr) ae UH dp, 


since, by (9.10), (Ar)~” J, (Ar) is a smooth function of Ar. Set 
(9.16) S(R*) = {flx+ : f € S(R) is even}. 
Lemma 9.2. If > —1/2, then 

(9.17) H, : S(R*+) — S(R*). 


Proof. By (9.10), J,(Ar)/(Ar)” is a smooth function of Ar. The formula (9.8) 
yields 


(9.18) < Cy, <0, 


for Ar € [0,00), v > —1/2, a result that, by the identity 
2 \1/2 
(9.19) Lip) = (=) cos z, 
TZ 
established in (6.35) of Chap. 3, also holds for v = —1/2. This readily yields 


(9.20) H,, : S(R*+) —> L™(R*), 


9. The Laplace operator on cones 171 


whenever v > —1/2. Now consider the differential operator i given by 


r = —2v-1 0 Qv 1 Of 
Lyf(r)=-r = (r + =) 
(9.21) 


Of Ww+i1df 
Or? r Or’ 


Using Bessel’s equation (9.9), we have 


(9.22) 2, (4O7) _ y2Fv(r) 


and, for f € S(R*), 
(9.23) 


Since f € L*(RTt) belongs to S(IR™) if and only if arbitrary iterated applications 
of Ly and multiplication by r? to f yield elements of L°°(R*), the result (9.17) 
follows. We also have that this map is continuous with respect to the natural 
Frechet space structure on S(R7). 


Lemma 9.3. Consider the elements Ey € S(IR™), given for b > 0 by 


(9.24) Ey(r) =e". 
We have 
(9.25) A, Ey/2(A) = E1/2(A), 


and more generally 
(9.26) H,Ey(X) = (2b)-”"" E1 j40().- 


Proof. To establish (9.25), plug the power series (9.10) for J,(z) into (9.15) and 
integrate term by term, to get 


(9.27) H,E (A) = Cha d2k / poktQv+l or? /2 ra 
A vl1/2 ran kID(k + v +1) a : 


This last integral is seen to equal 2*+”T'(k + v + 1), so we have 


172 8. Spectral Theory 
~ sly oye |K 
(9.28) Hy Ey /2(A) = S- _(-=) = eV /2 = Ey /2(d). 


Having (9.25), we get (9.26) by an easy change of variable argument. 
In more detail, set r?/2 = bs”, or s = r/v/2b. Then set p = V2bA, so 
Ar = ps. Then (9.28), which we can write as 


(9.29) | eZ 7, rye’ tt dr = W%e/?, 
0 


translates to 


(9.30) | eo” F548) (2b) YtD/254+1(25)1/2 ds = (2) -¥/2 ye #7140 
0 


or, changing notation back, 


(9.31) | e-°?” J, (da)a’ +) ds = (20)-*— 1 te, 
0 


which gives (9.26). 
From (9.26) we have, for each b > 0, 
(9.32) HH, By = (2)-”" H, Exas = Er, 


which verifies our stated Hankel inversion formula for f = Ey, b > 0. To get the 
inversion formula for general f € S(T), it suffices to establish the following. 


Lemma 9.4. The space 
(9.33) VY = Span {Ey : b > 0} 


is dense in S(R*). 


Proof. Let Y denote the closure of V in S(R*). From 


1 
(9.34) =e _ oe (hteyr") eae 


we deduce that r2e~"" € Y, and inductively, we get 
(9.35) rie” eV vVjeZt. 
From here, one has 


(9.36) (cosér)e~” EV, VEER. 


9. The Laplace operator on cones 173 


Now each even w € S’(R) annihilating (9.36) for all € € R has the property that 
e~" w has Fourier transform zero, which implies w = 0. The assertion (9.33) 
then follows by the Hahn-Banach theorem. 


Putting the results of Lemmas 9.2—9.4 together, we have 
Proposition 9.5. Given v > —1/2, we have 
(9.37) HH f = f, 
for all f € S(R*). 
We promote this to 
Proposition 9.6. [fv > —1/2, we have a unique extension of H,, from S (R*) to 
(9.38) H, : L?(Rt,r2’+! dr) —> L?(Rt, \?"+! dd), 
as a unitary operator, and (9.37) holds for all f € L?(Rt,r?”*" dr). 


Proof. Take f, g € S(IR™), and use the inner product 


(9.39) (a= | f(r)g(r)r2"*? dr. 


Using Fubini’s theorem and the fact that J,(Ar)/(Ar)’ is real valued and sym- 
metric in (A,r), we get the first identity in 


(9.40) (4Lf, #9) = (BLES, 9) = (9), 


the second identity following by Proposition 9.5. From here, given that the linear 
space S(R*) Cc L?(Rt,r?”*! dr) is dense, the assertions of Proposition 9.6 are 
apparent. 


We return to the Hankel transform (9.12). Note that 
(9.41) H,(r” f)(\) = A F(A), 
and that MM, f(r) =r’ f(r) has the property that 
(9.42) M, : L?(Rt,r?”*! dr) — L?(Rt,rdr) is unitary. 
Thus Proposition 9.6 yields Proposition 9.1. 
Another proof is sketched in the exercises. An elaboration of Hankel’s original 


proof is given on pp. 456—464 of [Wat]. 
In view of (9.23) and (9.41), we have 


174 8. Spectral Theory 


A, (r“Lyg) - | Lylr*J,(Ar))g r™ dr 
0 
(9.43) — =» [ or? Ln dr 
0 
==)" H, (rg). 


Now from (9.5)—(9.13), it follows that the map H. given by 
(9.44) Hg = (Fi (0°90), Hin (2), . ; 


provides an isometry of L?(C(N)) onto L?(Rt, A dX, £7), such that A is carried 
into multiplication by —\?. Thus (9.44) provides a spectral representation of A. 
Consequently, for well-behaved functions f, we have 


f(-A)g(r, 2) 
045) Lae | f(X2)I,,(Ar) i: s-2,J,, (As)g;(8) da da p,(2). 
7 10 0 
Now we can interpret (9.45) in the following fashion. Define the operator v 
on N by 
(9.46) vy = (—Ay +02)”, 


Thus vy; = v;~;. Identifying operators with their distributional kernels, we 
can describe the kernel of f(—A) as a function on R* x R* taking values in 
operators on NV, by the formula 


i-= Gun)* [ POLO Lona 


= K(r1,r2,V), 


(9.47) 


since the volume element on C'(V) is r™ dr dS(a) if the m-dimensional area 
element of N is dS(a). 

At this point it is convenient to have in hand some calculations of Hankel 
transforms, including some examples of the form (9.47). We establish some 
here; many more can be found in [Wat]. Generalizing (9.31), we can compute 
i e~’"* J,,(\r)r#+! dr ina similar fashion, replacing the integral in (9.27) by 


(9.48) | O Bebe eb dp = ame ae +5+h+1). 
0 


We get 


9. The Laplace operator on cones 175 


er Jul dryrtt dr 
0 
(9.49) 


STE +E+K+L dye 
= vo—v—1p—p/2-v/2—-1 2 2 
‘ » ees a 


We can express the infinite series in terms of the confluent hypergeometric func- 
tion, introduced in §7. A formula equivalent to (7.35) is 

T(b) QT(atk) z* 

T(a) T(b+k) kl? 


(9.50) 1 F(a; b; z) = 


since (a), = a(a+1)---(a+k—1) =T(a+k)/T(a). We obtain, for Re b > 0, 
Re(t+v) > —2, 
(9.51) 


| eer” Jy(Ar)r4t! dr 


0 
r(§+$4+)) oy Pa 
= \Ygr¥-1p-H/2—v/2-1 N22 EF ( 1: ie -=). 
Toe. ea 
We can apply a similar attack when ec” is replaced by e~°”, obtaining 


Co 


eI, (Ar)r#— dr 
(9.52) 


v 22 2 
(3) wry Fat 8 (2)! 


at least provided Re b > |A|, v > 0, and x + v > 0; here we use 


(9.53) | er ptk te tynl dp = p-2h-H-YT (yy + y+ 2k) 
0 


The duplication formula for the gamma function (see (A.22) of Chap. 3) implies 


1 
(9.54) D(2k-+ wt v) = nV 2g2hrety—ap( E +S bR)r(E += Hk+5), 


so the right side of (9.52) can be rewritten as 


(9.55) a NG eS 
k=0 


ce ee 
kID(v + 1+k) b? 


This infinite series can be expressed in terms of the hypergeometric function, 
defined by 


176 8. Spectral Theory 


~ ay ag ok 
2F,(a1, a2; 6; z) = 2 ( a )k — 
~ P(a)P(a2) y T(b+k) kl’ 


for a1, a2 ¢ {0,—1,—2,...}, |z| < 1. If we put the sum in (9.55) into this form, 
and use the duplication formula, to write 


MY wiviiil 1/29-—p—v4+1 
Ta) =F )r( es 7 a ; 
(a1) P (a2) 919 atagts 7 (u + v) 


we obtain 
/ eI, (Ar)rt—t dr 
) 


A\’, pl (uty) bovpe ov dl Da 
Se ea ee Poa ae 
) Feel) ha ota ot Be 


(9.57) 


This identity, established so far for |\| < Re b(andv > 0, w~+v > 0), continues 
analytically to \ in a complex neighborhood of (0, 00). 

To evaluate the integral (9.47) with f(A?) = en we can use the power 
series (9.10) for J, (Ar) and for J,(Arz) and integrate the resulting double series 
term by term using (9.48). We get 


(9.58) 
| eT, (rd) Jo(r2A)A dd 
0 
. 1 (rey Tw+j+k+1) 1 re\i¢ ra\k 
ar At ) 7 x, Ee centers Sarr ra ( ra) , 


j,k20 


for any t,71,72 > 0, v => 0. This can be written in terms of the modified Bessel 
function I,,(z), given by 


eo =) Sapte) 


One obtains the following, known as the Weber identity. 


Proposition 9.7. For t,r1,r2 > 0, 
a 1 
(9.60) / e™® FT, (rid) Jy (r2d)A dd = = e7 1 t72)/4 1(32). 
0) 2t t 


Proof. The left side of (9.60) is given by (9.58). Meanwhile, by (9.59), the right 
side of (9.60) is equal to (1/2t)(rir2/4t)” times 


9. The Laplace operator on cones 177 


2 2 


“en ae z) ( fe rere ee 


Lym>=> n=0 


If we set y; = —r5 /At, we see that the asserted identity (9.60) is equivalent to the 
identity 


(VEG +I +t R+1) jel 1"? 


1 1 
2, film! nID(v +n + 1) 


(9.62) 


£, 


l+n, m+n 
1 Yo: 


We compare coefficients of ylyk in (9.62). Since both sides of (9.62) are sym- 
metric in (yi, y2), it suffices to treat the case 


(9.63) I<, 


which we assume henceforth. Then we take + n = j, m+n = k and sum over 
n € {0,...,7}, to see that (9.62) is equivalent to the validity of 


(9.64) 


1 — Twtg+kti) 1 


= G-n\(k—-n)lnT(v+nt 1) Tw+jgt Pv t+k+1) gle 


whenever 0 < j < k. Using the identity 
Tv+jtl)=(v4+j)---vtnt+)Di(vtn+l) 


and its analogues for the other I’-factors in (9.64), we see that (9.64) is equivalent 
to the validity of 


(9.65) d Ga a inl (v+j) +++ (v+nt1) = (v+-j+k)---(v+k+1), 


for 0 < 7 < k. Note that the right side of (9.65) is a polynomial of degree 7 in v, 
and the general term on the left side of (9.65) is a polynomial of degree j —n inv. 
In order to establish (9.65), it is convenient to set 


(9.66) w=Vvt+i 


and consider the associated polynomial identity in jz. With 


178 8. Spectral Theory 


pot) =1, pi(u)=p, pe(w) = u(u—1),... 


ey) pj(w) = w(u—1)-+-(H- 9 +1), 


we see that {po,1,..-,p;} 1s a basis of the space P; of polynomials of degree j 
in y, and our task is to write 


(9.68) p(w k) = (w+ k)(wtk-1 + (u+k—-J+)) 
as a linear combination of po, ..., p;. To this end, define 
(9.69) T:P; —P;, Tp(u) = p(ut+ 1). 


By explicit calculation, 


1) = pi(Z) + po(u), 


pilus 1 
1) = (w+ De = p(w 1) +2 = po(p) + 2p (u), 


(9.70) 
po(p + 


and an inductive argument gives 
(9.71) Tp; = pi + ipi-t. 


By convention we set p; = 0 for2 < 0. Our goal is to compute T*p;. Note that 


and 
ro (k 
(9.73) T= 57 (") Nn", 
n=0 
if 7 < k. By (9.72), 
(9.74) Np = ii = 1) (72+ pin, 
so we have 


k 
Ty => (AAG Gn + pn 
(9.75) = 
: kl! jl! 
~ La (k—nyinl Gayl 


This verifies (9.65) and completes the proof of (9.60). 


9. The Laplace operator on cones 179 


Similarly we can evaluate (9.47) with f(A?) = e~**/A, as an infinite series, 
using (9.53) to integrate each term of the double series. We get 


(9.76) 
| 7 eA T(r A) JL (red) da 


0 
2 2 


ae T(2v + 2j + 2k +1) a ay a) 
ENP ASTUTE GFT +R +1) RIN 47 \ 4b 7? 


provided t > r; > 0. Itis possible to express this integral in terms of the Legendre 
function Q,_1/2(z). 


Proposition 9.8. One has, for all y,r1,r2 > 0, v => 0, 
(9.77) 


re 1 es ritre+y? 
[Patria lrad) d= Aare) Qa (AEB EE), 
0 


2r1r2 


The Legendre functions P,_j/2(z) and Q,_1/2(2) are solutions to 
d , d 5 1 
(9.78) = l(a -z )—u(2)| za (v 2 +) u(z) =; 
Compare with (4.52). Extending (4.41), we can set 


7 a ha - 
(9.79) P,-1/2(cos 0) = — Z (2cos s — 2cos 0) 1? cos vs ds, 
T JO 


and Q,,_1/2(z) can be defined by the integral formula 


(9.80) Qv-1/2(cosh 7) = / (2 cosh s — 2 cosh n)~"/? e~*” ds. 
” 
The identity (9.77) is known as the Lipschitz-Hankel integral formula. 


Proof of Proposition 9.8. We derive (9.77) from the Weber identity (9.60). 
Recall 


(9.81) Ly) =e 7"? Ifiy), y>0. 

To work with (9.60), we use the subordination identity 

(9.82) eo = AL 7 eu [At et 4-1/2 dt; 
vi Jo 


cf. Chap. 3, (5.31) for a proof. Plugging this into the left side of (9.77), and using 
(9.60), we see that the left side of (9.77) is equal to 


180 8. Spectral Theory 


(rf +r3+y? )/4t 7 Tir2\ ,-3/2 
(9.83) safe i+r3 (22 = )t dt. 


The change of variable s = r,1r2/2t gives 


(9.84) 4/ ~ (rire) 1? f e Sri trety?)/2rire Ligya/? ds. 
7 0 


Thus the asserted identity (9.77) follows from the identity 


- 2 
(9.85) ip e~**I,(s)s-1/? ds = 4/-Q,-1/2(z), 2 >0. 
0 T 


As for the validity of (9.85), we mention two identities. Recall from (9.57) that 


(9.86) ) 
| e ** J,(As)s¢—| ds = (3) ee 


Next, there is the classical representation of the Legendre function Q,,_1 /2( z) as 
a hypergeometric function: 


(9.87) 


P(3)rw+5 ae as ee | ai 
Qu-aple) = BEEF) (9, 1/2 OF (5+ + pth) 


cf. [Leb], (7.3.7) If we apply (9.86) with A = i, w = 1/2 (keeping (9.81) in 
mind), then (9.85) follows. 


Remark: Formulas (9.77) and (9.60) are proven in the opposite order in [W]. 

By analytic continuation, we can treat f(\?) = e~©*\~! sin At for any € > 0. 
We apply this to (9.47). Letting « \, 0, we get for the fundamental solution to the 
wave equation: 


(—A)~1/? sin t(—A)*/? 


(9.88) == Tan (aira)" i [« ~EHOA Tri) Jp (Are) dd 


ri trst ee) 


= —2(ryr2) - ie Im Qy- 1/2{ 2rire 


Using the integral formula (9.80), where the path of integration is a suitable path 
from 7 to +00 in the complex plane, one obtains the following alternative integral 


9. The Laplace operator on cones 181 
representation of (—A)~‘/? sin t(—A)!/?. The Schwartz kernel is equal to 
(9.89) 0, if ¢< lr — ral, 


1 - 2 2 2 —1/2 
(9.90) avira) f [t — (ri +73 — 2ryr2 cos s)] cos vs ds, 
T 0 
if |ry — ra] <t <11 +1, and 
1 a = 2 2. 2 —1/2 —sv 
(9.91) —(ryr2)* cos Ty [rt +75 + 2ryr2 cosh s —t | e °” ds, 
Tv 2 


ift > ry; + ra, where 


2 2 _ 42 a ee 
(9.92) b= cos (HERS), co cosh (2) 


2rir2 2rir2 


Recall that a = —(m — 1)/2, where m = dim N. 

We next show how formulas (9.89)-(9.91) lead to an analysis of the classical 
problem of diffraction of waves by a slit along the positive x-axis in the plane R?. 
In fact, if waves propagate in R? with this ray removed, on which Dirichlet bound- 
ary conditions are placed, we can regard the space as the cone over an interval of 
length 27, with Dirichlet boundary conditions at the endpoints. By the method of 
images, it suffices to analyze the case of the cone over a circle of circumference 
Am (twice the circumference of the standard unit circle). Thus C'(V) is a double 
cover of R? \ 0 in this case. We divide up the spacetime into regions I, II, and 
III, respectively, as described by (9.89), (9.90), and (9.91). Region I contains only 
points on C(N) too far away from the source point to be influenced by time ¢; 
that the fundamental solution is 0 here is consistent with finite propagation speed. 


Since the circle has dimension m = 1, we see that 
d2 \1/2 

_ = oe ~ (—s53) 
(9.93) v = (—Ay) a 


in this case if 9 € R/(47Z) is the parameter on the circle of circumference 47. 
On the line, we have 


(9.94) COS SV 50, (82) = [5(01 = A5 + 8) + 6(01 = Ao = s)] : 


Nl rR 


To get cos sy on R/(47Z), we simply make (9.94) periodic by the method of 
images. Consequently, from (9.90), the wave kernel (—A)~!/? sint(—A)/? is 
equal to 


(2m)? [t? — r? — 72 + 2rirecos(O, — O2))/? if |, — O2| <7, 


(9.95) 
0 if |0, — A9| > TT, 


182 8. Spectral Theory 


in region II. Of course, for |@; — 02| < 7 this coincides with the free space 
fundamental solution, so (9.95) also follows by finite propagation speed. 

We turn now to an analysis of region III. In order to make this analysis, it is 
convenient to make simultaneous use both of (9.91) and of another formula for 
the wave kernel in this region, obtained by choosing another path from 77 to co 
in the integral representation (9.80). The formula (9.91) is obtained by taking a 
horizontal line segment; see Fig. 9.1. 

If instead we take the path indicated in Fig. 9.2, we obtain the following 
formula for (—A)~!/? sin ¢(—A)1/? in region III: 


minrrennny | (? —r? —1r3 + 2rir2 cos s) 1? cos sv ds 

0 

(9.96) ; 

— sin nv f (?? —r? —r3 — 2ryre cosh a? a ah 
0 


The operator v on R/(47Z) given by (9.93) has spectrum consisting of 


1. 3 
(9.97) Spee v = {0,5,1,5,2,...}, 
all the eigenvalues except for 0 occurring with multiplicity 2. The formula (9.91) 
shows the contribution coming from the half-integers in Spec v vanishes, since 
cos $m = 0 if n is an odd integer. Thus we can use formula (9.96) and compose 
with the projection onto the sum of the eigenspaces of v with integer spectrum. 
This projection is given by 


(9.98) P = cos? nv 


on R/(47Z). Since sinan = 0, in the case N = R/(4mZ) we can rewrite 
(9.96) as 


FIGURE 9.1 Integration Contour 


9. The Laplace operator on cones 183 


FIGURE 9.2 Alternative Contour 


(9.99) x Array one | (ef — 1? — 72 + 2rire cos a Pcossyv ds. 
0 


In view of the formulas (9.94) and (9.96), we have 


P cos sv 06, (02) 
1 
(9.100) =5 [5(01 — 02 + s) + 5(01 — 02 — s) 
+6(0, — 02 +20 +8) +6(0, —02 +20 —s)] mod 4r. 


Thus, in region III, we have for the wave kernel (—A)~‘/? sin t(—A)!/? the 
formula 


(9.101) (4r)~"(#? — r} — 13 + 2rir2 cos(O1 — iy 


Thus, in region III, the value of the wave kernel at points (71,01), (12,02) of 
the double cover of R? \ 0 is given by half the value of the wave kernel on R? 
at the image points. The jump in behavior from (9.95) to (9.101) gives rise to a 
diffracted wave. 

We depict the singularities of the fundamental solution to the wave equation 
for R? minus a slit in Figs. 9.3 and 9.4. In Fig. 9.3 we have the situation |t| < 1, 
where no diffraction has occurred, and region III is empty. In Fig. 9.4 we have a 
typical situation for |t| > 1, with the diffracted wave labeled by a “D.” 

This diffraction problem was first treated by Sommerfeld [Som] and was the 
first diffraction problem to be rigorously analyzed. For other approaches to the 
diffraction problem presented above, see [BSU] and[Stk]. 

Generally, the solution (9.89)-(9.91) contains a diffracted wave on the bound- 
ary between regions II and II. In Fig. 9.5 we illustrate the diffraction of a 
wave by a wedge; here N is an interval of length 2 < 2a. We now want 
to provide, for general N, a description of the behavior of the distribution 
v = (—A)~1/? sint(—A)1/? 6, 79) near this diffracted wave, that is, a study of 
the limiting behavior as r; \, t — rg andas ry At— ro. 

We begin with region II. From (9.90), we have v equal to 


184 8. Spectral Theory 


0 


FIGURE 9.3 Reflected Wave Front 


(NY 


FIGURE 9.4 Reflected and Diffracted Wave Fronts 


(ra, X) 


FIGURE 9.5 Diffraction by a Wedge 


1 
(9.102) sine)? ? P,-1/2(cos 31) bx, in region II, 


where P,_1 /2 is the Legendre function defined by (9.79) and (By is given by 
(9.92). Note that as ry \,t— 12, 81 77. 
To analyze (9.102), replace s by 7 — s in (9.79), and, with 6, = 7 — 31, write 


9. The Laplace operator on cones 185 


# »-1/2(Cos 31) = cos nv f (2.cos 6; — 2.cos a) cos sv ds 
(9.103) a“ 


Tv 
+ sin nv [ (2cos 6; — 2.cos ae sin sy ds. 
61 


As 61 \, 0, the second term on the right tends in the limit to 


(9.104) sin ny | aaa 
0 


sin $8 
Write the first term on the right side of (9.103) as 


cosa | (2cos 6; — 2.cos s)~!/?(cos sv — 1) ds 
j1 


+oosny | (2cos 5, — 2coss)~!/? ds. 
61 


(9.105) 


As 61 \, 0, the first term here tends in the limit to 


7 = 
(9.106) cos Ty - ee 2. 
0 


se 
sin 38 


The second integral in (9.105) is a scalar, independent of v, and it is easily seen 
to have a logarithmic singularity. More precisely, 


i (2cos 0; — 2cos s)7? ds 
61 


(9.107) Ae 
he (log = S~ Ajo) + SBS, Ay = 1 
1° j=0 j=l 


Consequently, one derives the following. 
Proposition 9.9. Fix (r2, #2) and t. Then, asr, \,t — ra, 


(—A)~1/? sin t(—A)/? 54 29) 


1 _ 9 
(9.108) = (rata) Ms {lot 5, cos ty Sry 
+f ee a, + Ry Ox, > 
0 2cos 58 


where, for s > (m+ 1)/2, 


1 
(9.109) | Ry Sx ||D-s-1 < Co, log a as Oy \ 0. 
1 


186 8. Spectral Theory 
The following result analyzes the second term on the right in (9.108). 


Proposition 9.10. We have 


7 1 il 
| (2 cos 5°) (cos sy — cos Tv) ds 
0 


(9.110) K 
; T 
= cos TY ae ae + 9 sinty + Sx(v), 


where Sic (v) : D® + D8*2*, for all s. 


The spaces D* are spaces of generalized functions on N, introduced in Chap. 5, 
Appendix A. 
We turn to the analysis of v in region III. Using (9.91), we can write v as 


1 
(9.111) (ge cos TY Q,_1/2(cosh 32) d2,, in region III, 
ug 


where Q,,_1/2 is the Legendre function given by (9.80) and (2 is given by (9.92). 
It is more convenient to use (9.96) instead; this yields for v the formula 


1 Tv 
erry tf i (2 cosh 32 + 2.cos s)~/? cos sv ds 
(9.112) ° is 
— sin wv [ (2 cosh G2 — 2 cosh s)~1/2e~8” ash 
0 


Note that as7r, 7t— 12, Bo \, 0. 
The first integral in (9.112) has an analysis similar to that arising in (9.103); 
first replace s by 7 — s to rewrite the integral as 


cos wv f (2 cosh 32 — 2 cos aa cos sv ds 

(9.113) 0 : 

+ sin nv f (2 cosh By — 2.cos s)~!/? sin sv ds. 
0 


As 32 \, 0, the second term in (9.113) tends to the limit (9.104), and the first 
term also has an analysis similar to (9.105)—(9.107), with (9.107) replaced by 


i (2 cosh By — 2cos s)~!/? ds 
0 


~ (log x) S> Ais + S > BiG, Al = 1. 


j20 j21 


(9.114) 


9. The Laplace operator on cones 187 


It is the second term in (9.112) that leads to the jump across r; = t — rg, hence 
to the diffracted wave. We have 


Ba ds T 


0 VaR 2 


Be 
(9.115) | (2 cosh 32 — 2 cosh g)~N2—-84 Hes 
0 


Thus we have the following: 


Proposition 9.11. For r, At — ro, 


(—A)~¥/? sin t(-A)?? 6¢¢,,02) 


_i a—1/2 2 
(9.116) a ~(rira) { log Ba COS TY Ox, 
4 | COs sY — os TY i be, Te ay bn, + ii é., | 
0 2cos 58 D) 


where, for s > (m+ 1)/2, 
1 
(9.117) || Ride, ||p-»-1 < CBe log By as By \, 0. 
2 
Note that (9.116) differs from (9.108) by the term 1~!(rjr2)*~1/? times 
T 
(9.118) 3 sin TV Ox, - 


This contribution represents a jump in the fundamental solution across the 


diffracted wave D. There is also the logarithmic singularity, (7 r/o ? times 
1 2 

(9.119) — log — cos tv bz,, 
T é6 


where 6 = 6, in (9.108) and 6 = (3 in (9.116). In the special case where N is 
an interval [0, L], so dim C(N) = 2, coszv 6,, is a sum of two delta functions. 
Thus its manifestation in such a case is subtle. 

We also remark that if N is a subdomain of the unit sphere $?* (of even 
dimension), then cos 7v 6, vanishes on the set NV \ No, where 


(9.120) No = {x1 € N: for some y € ON, dist(x2,y) + dist(y,21) < 7}. 


Thus the log blow-up disappears on N \ No. This follows from the fact that 
cosT™V9 = 0, where vo is the operator (9.46) on 52k together with a finite 
propagation speed argument. 

While Propositions 9.9-9.11 contain substantial information about the nature 
of the diffracted wave, this information can be sharpened in a number of respects. 
A much more detailed analysis is given in [CT]. 


188 8. Spectral Theory 


There has been work on the propagation of singularities on manifolds with 
conic singularities, pursued in [MW], involving metric tensors of the form dr? + 
r?g, where g has a more general form, for example possibly depending smoothly 
on r. The base N is assumed to be a compact manifold without boundary, but 
within that category the metrics considered are more general than those treated in 
this section. The analytical tools are quite different from those used here. Further 
work, on propagation of singularities and diffraction effects, involving manifolds 
with “edge type” and “corner type” singularities, is given in [MVW,MVW2]. 


Exercises 


1. Using (7.36) and (7.80), work out the asymptotic behavior of 1F1(a;b;—z) as 
z — +00, given b,b-—a ¢ {0,—1,—2,...}. Deduce from (9.51) that whenever 
y>0,sER, 


oc . (tw tii 
(9.121) lim eT (r)\r-* dr =2-* G v =) : 
b\0 Jo T($(v+1+is)) 
2. Define operators 
(9.122) Mf@r)=rf(r), Tir) = f(r"). 
Show that 
M,. : L?(Rt,r dr) —> L?(R*,r~' dr), 

(9.123) F 4 27 1 

J: L?(R*,r* dr) — L?(R*,r* dr) 


are unitary. Show that 


(9.124) H#* = JM,H,M,' 
is given by 
(9.125) BE f(A) = (f+ £)(); 


where * denotes the natural convolution on R*, with Haar measure r~ ‘dr: 


(9.126) (f xg)(A) = [ f(r)g(r'A)r7! dr, 
and 
(9.127) L£(r) =r I(r"). 


3. Consider the Mellin transform: 
(9.128) M* f(s) = / f(r)r*—* ar. 
0 


As shown in (A.17)-(A.20) of Chap. 3, we have 


Exercises 189 


(9.129) (Qn) 1/2 M* : L?(R*,r7! dr) —> L?(R, ds), unitary. 
Show that 
(9.130) M*(f *g)(s) = M* f(s)-M*9(s), 


and deduce that 

(9.131) M* Hi f(s) = U(s)M* f(s), 
where 

(9.132) 


W(s) = [ Le pe dr - ie: JAr)r—** dee oe EGY +1- is)) 


r($(v+1+is)) 


4. From (9.126)-(9.132), give another proof of the unitarity (9.13) of H,. Using symme- 
try, deduce that spec H, = {—1,1}, and hence deduce again the inversion formula 
(9.14). 

5. Verify the asymptotic expansion (9.107). (Hint: Write 2cosé — 2.coss = (s? = 6°) 
F(s, 5) with F smooth and positive, F(0,0) = 1. Then, with G(s, 5) = F(s,6)~1/?, 


ds 
Jase 


Write G(s,d) = g(s) + 6H(s,6), g(0) = 1, and verify that (9.133) is equal to 
A; + Ao, where 


(9.133) | (2cos 6 — 2coss)~!/? ds = 1 G(s, 6) 
5 5 


= ds 1 1 
A= f G(s,5) S = g(0) log = + O( Slog =), 


Ao= [98 [t= *) ie DO — By Om 
Show that 
Bz = 9(0) i + ~ +] dt + 0(6) = C2 + 068), 
, lyeai 
with 


Use the substitution t = cosh u to do this integral and get C2 = log 2.) 
Next, verify the expansion (9.114). 


Exercises on the hypergeometric function 
1. Show that 2 Fi (a1, a2; b; z), defined by (9.56), satisfies 


T(b) 


1 
(9.134) 2F i (a1, a2;b; z) = Taree I eee ey ah 
a 0 


190 8. Spectral Theory 


for Reb > Reaz > 0, |z| < 1. (Hint: Use the beta function identity, (A.23)-(A.24) 
of Chap. 3, to write 


(Ce (0) 
(bd) T(a2)T(b 


L 
/ eens Wt) ia a dt, k=0,1,2,..., 
— a2) Jo 


and substitute this into (9.39). Then use 


Ze =(l—tz)“, 0<t<1, |z/<1) 
k=0 


2. Show that, given Re b > Reaz > O, (9.134) analytically continues in z to 
z€C\ [1, 00). 
3. Show that the function (9.134) satisfies the ODE 


eu 
dz? 


z(l—z + {b-(a +a24 12} ajagu = 0 


Note that u(0) = 1, u’(0) = a1a2/b, but zero is a singular point for this ODE. Show 
that another solution is 


u(z) = 27° oFi(ai — b+ 1,ag—b+1;2—8;2). 
4. Show that 
2F (a1, a2;b; 2) = (1— 2) 2Fi(a1,b— a2; b; (z — 1)"*z). 


(Hint: Make a change of variable s = 1 — ¢ in (9.134).) 
For many other important transformation formulas, see [Leb] or [WW]. 
5. Show that 
1F\(a;b; z) = lin 2 F(a, c; 0; cz). 


We mention the generalized hypergeometric function, defined by 


= (a)p 2” 
b; Mek 
(a; b; z) a ys zr 
k=0 
where p < q+ 1, a = (a1,...,@p), b = (b1,...,6¢), by € C \ {0,-1, -2,...}, 
|z| < 1, and 


(a)e = (ai1)eo*- (Ap), (Oe = (Or )e + (Baas 
and where, as before, for c € C, (c)x = c(e+1)---(e +k — 1). For more on this 
class of functions, see [Bai]. 
6. The Legendre function Q,,_1/2(z) satisfies the identity (9.87), for v > 0, |z| > 1, and 
|Arg z| < 7; cf. (7.3.7) of [Leb]. Take z = (r? + 73 + t?)/2rir2, and compare the 
resulting power series for the right side of (9.77) with the power series in (9.76). 


10. Quantum adiabatic limit and parallel transport 191 
10. Quantum adiabatic limit and parallel transport 


Let H(t),t € I = [0,1], be a smooth family of self adjoint operators on a 
Hilbert space H, with a smoothly varying family of eigenspaces E(t), of constant 
dimension k, with eigenvalues (t). Assume the spectrum of H(t) on E(t)+ is 
bounded away from A(t). Making a trivial adjustment, we will assume X(t) = 0. 
For simplicity, assume all the operators H(t) have the same domain. 

Consider the solution operator S(t, s) to the ““Schrédinger equation” 


0 
(10.1) op = (iu, 
taking u(s) to u(t). Now slow down the rate of change of H, and consider the 
solution operators S;,(t, s) to 


(10.2) = = iH(-)u 


The claim is that, if u(0) = uo € E(0), then S,,(nt,0)uo > w(t) € E(t) as 
nm —> oo, and there is a simple geometrical description of w(t). This limit is called 
the quantum adiabatic limit of the Schrodinger equation (10.1). 

This result was established by T. Kato [Kat], and rediscovered by M. Berry 
[Berl], the geometrical content brought out by B. Simon [Si4]. Berry worked 
with the case dim E(t) = 1, but that restriction is not necessary. The more gen- 
eral case was already dealt with by Kato; that the argument can be so extended 
was rediscovered by F. Wilczek and A. Zee [WZ]. A collection of subsequent 
literature can be found in [SW]. 

The geometrical structure is the following. The family E(t) gives a vector 
bundle E — I, a subbundle of the product bundle J x H. If P(t) denotes the 
orthogonal projection of H on F(t), then, as seen in §1 of Appendix C, Connec- 
tions and Curvature, we have a covariant derivative on sections of F defined by 


(10.3) Vrw(t) = P(t)Dru(t), 


where Dr is the standard componentwise derivative of -valued functions. 
Parallel transport is defined by Vrw = 0. One can see, using 


(10.4) P'P+PP'=P’, 


that parallel transport is also characterized by 


(10.5) — = P'(t)w, if w(0) € (0). 


192 8. Spectral Theory 


See Exercise 3 in §1 of Appendix C, or Exercise | below. The claim is that the 
adiabatic limit w(t) mentioned above exists and is equal to the solution to (10.5), 
with w(0) = up € E(0). 

To prove this, we will rescale the t-variable in the equation (10.2). Thus we 
compare the solutions wu and w to 


au = inH(t)u 
(10.6) re 
Uw _ pl 


given u(0) = w(0) = uo € E(0). Then we know w(t) € E(#) for each t, so 
H(t)w(t) = 0. Also, by (10.4), P’(t)w = (I — P)P’w = (I — P)w! 
Let v(t) = u(t) — w(t). Then v(0) = 0 and 


(10.7) “ —inH(t)v = —P+(t)w' = fit). 


Here we have used (10.4) and set P+ = I — P. Thus f(t) 1 E(t) for each t. 
Now let S,,(t,s) denote the solution operator to 0u/Ot = inH(t)u. Then the 
solution u(t) to (10.7) is given by 


t 

(10.8) u(t) = | S,(t, 8) f(s) ds 
0 

We know that 


oe (t, s) = —inS, (t, s) H(s). 


(10.9) 5s 


Now the spectral hypothesis on H(t) implies we can set 


(10.10) f(t) = H(t)g(t), 


(10.11) wpa, = Sn(t, 8) g(s) ds 


Hence 


10. Quantum adiabatic limit and parallel transport 193 


K 
(10.12) oC) lle < — 


This proves the following quantum adiabatic theorem. 


Proposition 10.1. Take H(t) and E(t), t € [0,1], as described in the first para- 
graph of this section, and let S,,(t,s) denote the solution operator to the first 
equation in (10.6). Then 


(10.13) S(t, 0)up —> w(t) as n> 00 if up € E(0), 


where w(t) is obtained from uo by parallel translation, as in the second equation 


in (10.6). 


Note that if H(1) = H(0), so E(1) = E(0), then w(1) will typically differ 
from ug by the application of a unitary operator on Ho, since the connection (10.3) 
on F is typically not flat. In the case Berry considered, where dimc (0) = 1, 
this could only be multiplication by e’”, @ being called Berry’s phase. 

If we assume that H(t) has purely distinct spectrum A1(t) < A2(t) <-:- , of 
constant multiplicity, and no crossings, then we can analyze the behavior of solu- 
tions to Ou/Ot = inH(t)u via superpositions. Let P;(t) denote the orthogonal 
projection of 1 onto the A, (t)-eigenspace of H(t). Write u(0) = 5° u,;(0), with 
u,;(0) € R(P;(0)) = E;(0). Let 7;(¢) denote parallel translation in the vector 
bundle R(P;(t)) = E;(t), ie., the solution operator to 


BG Pi()w;, w;(0) € E;(0). 


10.14 
Oo) dt 


Then we can compare w;(t) = 7;(t)u,;(0) to the solution w,(t) to 


(10.15) ots =inH(t)u;, wu,;(0) = P;(0)u(0), 


i.e., to S,(t,0)P;(0)u(0). By (10.11), we have 

1 

— = [9;(¢) — Sin(t, 0) 95 (0)] 
1 


t 
—— | Sin(t,s)95(s) as}, 
0 


in 


ug(t) = 450 { w5(t) 
(10.16) 


where S;,,(t, s) denotes the solution operator for 


O., 
aa in(H(t) — A;(2)), 


and g;(t) is obtained in a fashion similar to g(t) in (10.11). Also, we have set 


194 8. Spectral Theory 


A;() = [ Aj(s) ds. 
0 
If all the spectral gaps are bounded below: 
(10.17) Aj+i(t) — Ax (t) > C > 0, 
then we can decompose any u(0) € H and sum over j, obtaining 


S(t, 0)u(0) = 


e's T; (t)u(0) 
(10.18) 3 


1 inA, (t) [ ; 
—_ Untn 5 ‘ ¢ —_ a r on t, f d : 
am de gilt) ~ Sinlt,0)95(0) + J Sin(t,)ai(s) ds 
Similar approximations, for S;,(t,0) and S;,,(t,s), can be made on the right, 


and this process iterated, to obtain higher order asymptotic expansions. 
Of course, the hypothesis (10.17) is rather restrictive. If one weakens it to 


(10.19) |Aj4r(t) — Ag(4)| > CGH) > 0, 


then one can iterate (10.18), at least a finite number of times, provided u(0) 
belongs to the domain of some power of H(0). 


Exercise 
1. Let w(t) solve (10.5) (so P(0)w(0) = w(0)). Verify that P(t)w(t) = w(t) for 


all t € [0, 1]. Then verify that P(t)w’(t) = 0. 
Hint. Set €(t) = (I — P(t))w(t), and use (10.4) to show that 


e(t) = —P'(HE(t), E(t) = 0, 


hence €(t) = 0. 


11. A quantum ergodic theorem 
Let M be a compact Riemannian manifold, with Laplace operator A, and set 
A = /—A. Let {ux : k € N} be an orthonormal basis of L?(M) consisting of 


eigenfunctions, 


(11.1) Aug = AKUk, Ak vA +00. 


11. A quantum ergodic theorem 195 


Let X = S*M Cc T*M denote the unit cosphere bundle. The symplectic form on 
T* M induces a volume form dS on X, which we normalize to have unit volume. 
The Hamiltonian vector field associated to the principal symbol o(A) generates a 
smooth flow G; on X, preserving the volume form dS. Let P be the orthogonal 
projection of L?(X, dS) onto 


(11.2) V ={be L7(X,dS) :b0G, = Bb}. 
Our first goal in this section is to prove the following quantum ergodic theorem. 


Theorem 11.1. There is a set N CN, of density zero, with the following prop- 
erty. Let A € OPS°(M) and assume a = 0(A)|x satisfies 


(11.3) Pa=a= { ads. 
xX 
Then 
11.4 li A =@. 
(11.4) a ae Uk, Up) =a 


If the flow {G;} is ergodic on X, then V consists of constants, and (11.3) 
holds for all a. The ergodic case has been studied in [Shn], [CdV], [HMR], [Zel], 
[Don], and other works. Theorem 11.1 applies to cases where such ergodicity 
does not hold. Such a formulation was mentioned in [ST], and pursued in [T4] 
and [T5]. Our proof of Theorem 11.1 is adapted from [CdV]. We start with the 
following Weyl law. 


Proposition 11.2. Let A € OPS°(M), with principal symbol o(A) € 
C™(T*M \ 0), homogenous of degree 0. Then 


(11.5) slim Wd (Ate, an) = f ofA) ds. 
k= 


Proof. We have 


S- ete (Aug, Uk) = So (Aur, eAu,) 
(11.6) k k 


= Tr de“. 


t 


A parametrix construction for e~'“ yields 


196 8. Spectral Theory 


(11.7) 


The result (11.5) follows from Karamata’s Tauberian theorem. 


To proceed, we recall from Chapter 7 the notion of a “quantization,” which is 
a continuous linear map 


(11.8) op : C®(X) —> L(L?(M)), 


with the property that, given a € C™(X), A = op(a) is a pseudodifferential 
operator with principal symbol a. We also impose the condition op(1) = J, the 
identity map. The existence of quantizations follows via local coordinate charts 
and partitions of unity from pseudodifferential operator calculus on Euclidean 
space. Each one gives rise to a sequence of elements 4; € D’(X), defined by 


(11.9) (a, [tk) = (op(a)ug, Ug). 
Since (Kuy, uz) — 0 whenever K is compact on L?(M/), it follows that if op is 


another quantization, yielding i; € D’(X), then for each a € C™(X), (a, E) — 
(a, fix) — 0 as k — oo. Basic examples are “Kohn-Nirenberg” quantizations: 


(11.10) opgn : C*(X) —+ OPS*(M) C OPS? (M). 


For our analysis, it is useful to bring in the existence of a “Friedrichs quantiza- 
tion,” 


(11.11) opp : C®(X) —> L(L7(M)), 
having the property 
(11.12) a>0= > opp(a) >0, opp(1) =I. 


This is constructed on the Euclidean space level from opxyj(a) via “Friedrichs 
symmetrization,” in Chapter 7. Results given there imply 


(11.13) a € C°(X) => opp(a) — opyy(a) € OPS55_,(M), 


p,l—-p 


for some 6 > 0, p € (1/2,1). From (11.12) it follows that 


(11.14) | opr(a)|leqe2) < sup la, 


11. A quantum ergodic theorem 197 
and hence (11.11) has a unique continuous linear extension to 
(11.15) opp : C(X) — L(L7(M)), 
with (11.12) holding for all a € C(X). 
From here on we take a Friedrichs quantization, and set A = opp(a). In such 
a case, the distributions 4, € D’(X) defined by (11.9) satisfy 
(11.16) a > 0 => (a, ux) > 0. 


Also (1, x) = (ux, ux) = 1. Consequently, for each k, 


(11.17) [4x 18 a probability measure on X. 
We write 
(11.18) (Aug, Ur) = fedus. 

x 


The next step is to establish the following consequence of Egorov’s theorem. 


Proposition 11.3. Given a € C™(X), we have 


(11.19) [(a-a0a) dun, —>0, as ko, 
xX 


locally uniformly in t. 

Proof. Set A = opp(a) and 

(11.20) Ay = et Act, 

By Egorov’s theorem (and (11.13)) 

(11.21) A; — opp(ac Gy) € L(L?(M), H®(M)), 


and this holds locally uniformly in ¢, so 


(11.22) Je 0 Gy dup — (Ae uz, et AU,) —0, ask—>oo, 
Xx 


locally uniformly in ¢. But 


(11.23) (Ae ug, eS up) = (Aug, ur), 


198 8. Spectral Theory 

so (11.22) leads to (11.19). 

Proof of Theorem 11.1. 

We will actually establish a more general result, in which we take 

(11.24) A=op,(a), a€eC(X), 

and make the hypothesis (11.3), i.e., Pa = a, where P is the orthogonal projec- 
tion of L?(X, dS) onto the space V, given by (11.2). Note that Proposition 11.3 


readily extends toa € C(X). 
To proceed, given a € C(X), set 


T 
(11.25) ar= 7 f ao G; dt, a= | ads. 
T Jo 
xX 


Then von Neumann’s mean ergodic theorem (established in an appendix to this 
chapter) implies that, as 7’ + oo, 


(11.26) ar — Pa in L?-norm. 


Under the hypothesis Pa = a, we then have 


(11.27) [ler ~al|dS —+0 as T > 00. 
xX 


Thus, for € € (0, 1], there exists T, < oo such that 


(11.28) T>T.—> [ lar—ajds <e. 
x 


Now, Proposition 11.3 gives, for alla € C(X), T < cw, 


(11.29) ic —a) dp, — 0, 
xX 


as k — oo, hence 


(11.30) [iw G) dur ic G) duz — 0, 
x x 


as k — oo. Furthermore, Proposition 11.2 implies 


11. A quantum ergodic theorem 199 


N 
1 
(11.31) slim wf bm = f vas, 
aX 


for each b € C™(X), and then (11.17) gives this result for all b € C(X). Taking 
b = lar — G| gives 


N 

: 1 7 = 

(11.32) sim. S- i lar —a|duz = / lar — al dS, 
Neha X 

for each T’ < 00. Comparison with (11.28) gives 


N 
-. fl 7 
(11.33) T>T. => Jim x | ler ald << 
=e 


if a satisfies (11.3). 
It follows from (11.33) that, if a € C(X) satisfies (11.3), there exists a set 
N-.(a) CN, of density zero, such that 


(11.34) T=T:= >  limsup / lar —G| dup < 2e. 
k¢N-(a),k—00 cf 


Hence, by (11.30), for all e > 0, 


(11.35) lim sup la— fadus| <2e. 
kN. (a) ,k—00 


Now we can produce 

(11.36) N(a) CN, _ of density zero, 
such that, for all 2 € N, 

(11.37) Ng-¢(a) \ N(a) is finite. 


Then (11.35) gives, for all e € (0, 1], 


(11.38) limstp la— f adus| < 26, 
kEN (a),k—00 - 


so, if a € C(X) satisfies (11.3), 


200 8. Spectral Theory 


tt: li a - |= 
G13?) KANT soe \ i eile =O 
xX 


To proceed, let 
(11.40) T={ae C(x): Pa=G}, 


which is a closed, linear subspace of C(X) (equal to C(X) if {G;} is ergodic). 
We can take a countable set {a,,}, dense in Z, and produce 


(11.41) N CN, of density zero, 


such that, for all v, 


(11.42) N (ay) \ N is finite. 
Then 
x 


whenever a = a,, and hence, by a limiting argumant, using (11.17), for all a € Z. 
This proves Theorem 11.1. In fact, it proves the following more general result. 


Proposition 11.4. There is a set N CN, of density zero, with the following 
property. Take a € C(X) satisfying Pa = G, and set A = op,(a). Then 


11.44 li Aur =@. 
( ) eee Up, t,) =o 


Here is a further generalization. Replace (11.3) by 


(11.45) a, Pa€ C(X). 
Then 
(11.46) b=a-—Pa=>beC(x), Pb=0, 


and Proposition 11.4 yields 


11.47 i : - 
GON acs (opp( ), Uk, Wk) 


If we set 


(11.48) A=opp(a), Ap =op,(Pa), 


11. A quantum ergodic theorem 201 
then opp(b) = A — Ap, and we have the following quantum ergodic theorem. 


Proposition 11.5. Assume a, Pa € C(X), and define A, Ap as in (11.48). Then 


(11.49) el { (Aux, ux) =i Asie, ux)} =U, 


As mentioned, work on quantum ergodic theorems started with the case that 
{G,} acts ergodically on X, or equivalently that Z = C'(X), where T is defined 
by (11.40). The conclusion that (11.4) then holds for all A € OP.S°(M) can be 
called a microlocal quantum ergodic theorem. The version stated in [Shn] was the 
more restricted local version, that (11.4) holds for all multiplication operators, 


(11.50) Au(x) = a(x)u(x), 


with a € C(M). A consequence of Proposition 11.4 is that for this result to hold 
for all such A, we need not require that {G;} act on X ergodically. It is enough 
to require that 


(11.51) CCT, C={aEeC(X) : a(az,€) =a(az)}. 


An example for which (11.51) holds is M = T” = R"/Z”, a flat torus. In such 
a case, {G;} is clearly not ergodic on X = T” x $”"~1. See [T4] for further 
examples illustrating Theorem 11.1 and Proposition 11.5, including cases when 
a€ C™(X) and Pa € C(X), but Pa ¢ C'(X). 

It is of interest to record the following local quantum ergodic theorem. 


Proposition 11.6. Assume {G,} acts ergodically on X, or more generally that 
(11.51) holds. Then if 


(11.52) ae R(M), 


i.e., a is bounded and Riemann integrable on M, we have 


(11.53) elim / a(x) |up(a)|2 dV (a) = a. 
M 


In particular, given O C M open, 


V(O) 


F 2 = 
(11.54) ebm, | lun(o) dV (x) = V(M)’ 
Oo 


provided xo € R(M), i.e, m*(OO) = 0. 


Proof. Given (11.52), we can find 


202 8. Spectral Theory 


(11.55) fuigeEC(M), frsak<g, [i= fav =5, 50, 
M 


and apply Proposition 11.4 to multiplication by f,, and g,. Taking v — oo yields 
(11.53). 


It is also of interest to see if (11.53) extends to more general a € L™(M). 
Some results on this are discussed in [T5]. 


A. Von Neumann’s mean ergodic theorem 


Let {U(t) : t € R} bea strongly continuous unitary group on a Hilbert space H. 
A fundamental object of ergodic theory is the study of the “time average” 


1 T 
(Al) Arf=z f U(s)fas 
0 


with emphasis on the limit as T —> oo. To get this, we can set U(t) = e”4, 


where iA generates U(t), with A self-adjoint (possibly unbounded) and use the 
spectral representation in Theorem 1.7 (or Proposition 1.3 plus a cyclic space 
decomposition of H) to write 


T 
(A.2) Es | U(s) fds = G(TAYf, 
T 0 
where 


(A.3) 


In other words, using a unitary map W : L?(Q, 4) > H as in (1.29), 

(A4) WArWog(2) = G(Ta(x))g(x), g € L*(O,p). 

Now the dominated convergence theorem implies that, for each g € L?(Q, 1), 
(A.5) G(Ta)g —> xg, in L?-norm, 


as I’ + oo, where 


A.6 : 
(A) | , 


A. Von Neumann’s mean ergodic theorem 203 
We therefore have the following abstract mean ergodic theorem. 


Theorem A.1. /f U(t) = e”4 is a strongly continuous unitary group on H, and 
Ar is given by (A.1), then, for all f € H, 


(A.7) Arf — Pf in H-norm, 
as T’ —+ 00, where P is the orthogonal projection of H onto 
(A.8) V=KerA={feH:U(t)f=f, VteR}. 


This theorem also has a discrete analogue, where we take a unitary operator U 
on #7 and consider 


1 N-1 
(A.9) Anf=z sry. 
k=0 


We leave the formulation of such a result to the reader. 

In ergodic theory, such unitary groups arise as follows. Let jz be a probability 
measure on a o-algebra F of subsets of X. One studies properties of groups of 
measure-preserving maps y; : X — X. That is, we assume 
(A.10) SEF = ¢;,"(S) €F and p(y; *(S)) = u(S), 
for all t € IR. We assume 
(A.11) po(r)=2z, Psrt=Ysoyr:, Vs,teER. 

The family {,} defines a family of linear maps on functions: 
(A.12) U(t)f(@) = f(ve(@)). 


From (A.10) we obtain 


(A.13) / f(or(a)) du = / fa) ds, 


for all f € L'(X, p). Hence 
is an isometry for all t € R, p € [1, 00), and we have 


(A.15) U(0)=1, U(st+t)=U(s)U(t), Vs,teER. 


204 8. Spectral Theory 


We also impose a continuity requirement on {y,}, namely that {U(t)} be a 
strongly continuous group on L?(X, ju), for each p € [1, 00). For a basic example 
of when this holds, let X be a compact metric space, F = B the o-algebra of 
Borel sets, 4 a probability measure on G, and let {y;} be a group of -preserving 
homeomorphisms of X, with continuous dependence on t. 

Theorem A.1 immediately applies to such situations, with H = L?(X, 1). We 
next establish a more general result. 


Proposition A.2. In the setting described above, with U(t) given by (A.12), let 
P denote the orthogonal projection of L?(X, 1) onto 


(A.16) V={feL7(X,p):U®Osf= f, Vt eR}. 

Then, for p € [1,2], P extends to a continuous projection on L?(X, 4), and 
(A.17) Per p> Ape So PP 

in L?-norm, as T —> ov. 


Proof. Note that the L?-operator norm ||Aq|| c(i») < 1 for each T, and since 
lg\lze < |lgl|z2 for p € [1,2], we have (A.17) in L?-norm for each f in the 
dense subspace L?(X, i) of L?(X, uw). Now, given f € L?(X,p), € > 0, pick 
g € L?(X, p) such that || f — g||z» < e. Then 


(A.18) ||Arf-Asfllze < ||Arg—Asgllz2+||Ar(f—g)||ne+||As(f-g)|lze- 
Hence 


(A.19) lim sup ||Arf — Asf|lz»> <2e, Ve>0. 
S,T—0o 


This implies that (Av f) is Cauchy in L?(X, 1), for each f € L?(X, ). Hence it 
has a limit; call it Qf. Clearly Qf is linear in f, ||Qf||z» < ||f||ze, and Qf = Pf 
for f € L?(X, yw). Hence Q is the unique continuous linear extension of P from 
L?(X, 1) to L?(X, 2) (so we change the name to P). Note that P? = P on 
L?(X, p), since it holds on the dense linear subspace L?(X, j1). Proposition A.2 
is proved. 


In the setting of Proposition A.2, note that P = P*, and hence 
(A.20) P: L?(X,p) — L?(X,u), Vp € [1,00]. 
Also, as indicated above, 


(A.21) |Arlleae) <1, VIER, pe [1, oo]. 


B. Wave equations and shifted wave equations 205 


We aim to extend Proposition A.2 to p € (2,00). The following yields a key 
ingredient. 


Lemma A.3. Jn the setting of Proposition A.2, given p < ~, 
(A.22) f © L°(X, yp) = Arf > Pf, in L?-norm, 


as T > o. 


Proof. If (A.22) fails, there exists ¢ > 0 and T;, > oo such that g, = Ar, f 
satisfies 


(A.23) llgn — Pfllne >e, Vn. 
We know that g,, + Pf in L?-norm, so, passing to a subsequence, we have 
(A.24) Gn(“%) —> Pf(ax), for p-ae. xe X. 


We also know ||gn||z- < ||f||z-0, so the convergence (A.24) occurs boundedly. 
The dominated convergence theorem then gives 


(A.25) llgn — Pfllze — 0, 
contradicting (A.23). 


Now for our extension. 


Proposition A.4. In the setting of Proposition A.2, we have (A.17), with conver- 
gence in L?-norm, for each p € [1, co). 


Proof. This follows from Lemma A.3, the uniform operator bound (A.21), and 
the denseness of L°(X, j) in L?(X, yw), by arguments similar to those arising in 
the proof of Proposition A.2. 


REMARK. The mean ergodic theorem has an extension to non-invertible measure- 
preserving transformations, which yield non-unitary operators on L?(X, j1), and 
require a diffferent technique. This can be found in various texts, including Chap- 
ter 4 of [Haw] and Chapter 14 of [T3], which also treat the Birkhoff ergodic 
theorem. 


B. Wave equations and shifted wave equations 


In Sections 4 and 5 we studied the Laplace operator on spheres S” and hyperbolic 
space H.”, respectively. Key identities included neat formulas for the solution to 
shifted wave equations, 


206 8. Spectral Theory 


Ou 
(B.1) ap ~ LU = 0, 
with 
a4 
(B.2) L=A—a2onS", L=A+a2zonH™, an= S 


Here we show how such formulas lead to formulas for the solutions to the regular 
wave equation 0?u — Au = 0. Following [MT], we establish an abstract result: 


Proposition B.1. Let Lo be a negative self-adjoint operator, and, for a € R, set 
(B.3) [Iy,=Lo+a?, Ly=Lp—a’. 


Then, fort € R, 


t 
es Ii (avi? — s?) as 
(B.4)  cost,/—L, = cos i — Dp + at | cos 8\/—Lo ds, 
0 vt 


and 


Ji( — 
(B.5)  cost,/—L2 = costv/ aot f Ae 1 ) toast as. 


Here J; is the Bessel function 


aoe ee 
ee) 20)= Vas ia) 
and 
2k+1 
(B.7) £OV=-11@) => EET = (3 ) 


To prove (B.4), we borrow a trick from §5 of Chapter 3. It starts with the 
identity 


1 ng 
B.8 eff -=| hi(s) cos s\/—L1f ds, 
( ) f Jon es t(s) if 
with h,(x) = e~'* hence 


‘ T. 8 
(B.9) hi(s) = 4/—e7* /*. 


B. Wave equations and shifted wave equations 207 


If we set 4t = 1/A, we see that w(t) = (cos t./—Ly1) f satisfies 


(B.10) i. w(s)e7**” ee sy cet snc saita 
0 2V 


By comparison, wo(t) = (cos t,/—L) f satisfies 


°° 1 | 
(B.11) , wo(s)e~>* ds = — T e(t/4a)Lo f 
0 2VA 


Consequently, 


(B.12) | w(s)e** ds = oy wo(s)e** ds. 
0 0 


Now the change of variable c = s? makes (B.12) a relation between Laplace 
transforms of 


wo(vo) 


(B.13) W(o) = 75 


and wo(7) = 


a /4X 


Hence a representation of e as a Laplace transform 


(B.14) gee = y(a)e >? da 
0 


will give rise to a convolution formula: 


(B.15) wo) = f e(a)volo - 2) ar. 
0 
To identify the function y(c) in (B.14), we start with the following: 
= v+1 —Xs? — a” —a?/4r 
(B.16) ‘ J,(as)s” Te ds = Dyer /4r_ 


This is one of the most fundamental Hankel transforms and is used to prove 
the Hankel inversion formula in Proposition 9.1 (cf. (9.31)) of this chapter. We 
recall the method of proof was to replace J,,(as) in (B.16) by its power series 
expansion: 


y] 


oo (—1)* as\ ¥t2k 
(B.17) J(as) = » KIT(k + +1) ( 2 


208 8. Spectral Theory 


and integrate term by term. This works for 7 > —1. We want to pass to the limit 
vy \, —1. Of course, the integrand converges pointwise to 


(B.18) J-i(as)e” = —Sy(as)e”. 


The integral involving each term in (B.17) converges to the integral of the corre- 
sponding term in the power series of J_ (as) (times e~ 8”) except for the term 
k = 0. Rather, due to the fact that, withy = —1+<¢«, B>0, 


1? 1 
B.19 — el dg = —__— B* — 1, 0, 
( ) ol. : 7 el(e) ae Ss 
the result we arrive at is 


oo L= —a?/4d 
(B.20) | Ji(as)e*” ds = —_<$ 
0 


a 


Next we can analytically continue in a. We have (cf. (6.55) of Chapter 3), 

(B.21) L(r) =e L(ir), r>0, 

and in particular [;(r) = —iJ,(ir), as in (B.7). Hence, for a, A > 0, 
a?/4X_ 4 


(B.22) | Ih(as)e* ds = < —__~ 
0 


a 


We have achieved (B.14), and hence (B.15), with 
a 
Jo 


Thus (B.15) yields a relation between w(o) and wo(c) that, in view of (B.13), 
gives the desired identity (B.4). The proof of (B.5) follows along similar lines, as 
the reader can verify. 

Here is a specific conclusion for the wave equations on S” and H”. 


(B.23) y(o) = = (avo) + bo. 


Corollary B.2. For M = S” or H", let A be the Laplace operator, and define L 
and Qy, as in (B.2). Then, on S”, 


* Ti(anVt? — 8? 


JP — 2 


cos s/—Lds, 


(B.24) cost¥~—A=cost¥—L + ant [ 
0 


and, on H”, 


* Ti (anVt2 — 87) 
VP — g2 


cos s/—Lds. 


(B.25) cost/—A =costV—L — ant [ 
0 


B. Wave equations and shifted wave equations 209 


Proof. Apply (B.4), (B.5) to S” and H”, respectively. In both cases, take 
Lo = L, a= ay. For S”, we have L; = A, and for H”, we have Lz = A. 


For applications to wave equations, it is useful to complement formulas (B.4)— 
(B.5) with formulas for (sint,/—L,)/.,/—L,. For simplicity, we concentrate on 
the case 7 = 2. Computations for 7 = 1 are similar. To start, integrating (B.5) 
gives 


sin tY¥—L2 _ sin bv — 4 


(B.26) = ; 
Ts Ty R(t) 
with 
4/72 — 
— @=-af i 2 a) cos 8\/ —L ds dr 
(B.27) : 
- 2_ 2 
a, (| Mal) cos 81/ —Lo ds. 
0 s 72 — s2 
Using 
d si —L 
(B.28) Ming = 
ds —Lo 
and integrating by parts gives 
‘9 sin s\/—Lo 
B.29 R(t) = A(s,t d 
(B.29) (=a fs Als.t) A as, 
with 
t 2 See 
(B.30) A(s,t) = / tale 
' 72 2 
If we write 
Ji (az) 
B.31 = 
(B.31) y(z) rie 


which is an entire function of z, then 


t 
(B.32) A(s,t) = i, T(r? — 8”) dr. 


Using 


210 8. Spectral Theory 


a,(r? — 8”) = —2sq' (7? — 8°), 


B.33 
—_ d,p(r? — 5%) = aryl (7? — 5%), 
we obtain 
fa) t 
75st) = —s(0) +f TO,0(T? — 8”) dr 
s s 
(B.34) 


t 
= —s(0) — | 0,y(r? — 8”) dr 
= —sy(t? — s”). 

Plugging this into (B.29) yields the following conclusion. 


Proposition B.3. In the setting of Proposition B.1, (sint\/—L2)/V—Lz2 is given 
by (B.26), with 


ds. 


t . 
(B.35) R(t) = —a i _Jilav! = 5?) sin sV¥=Lo 
0 


V t? = g2 Vv —Lo 
Corollary B.4. In the setting of Corollary B.2, on H” we have 
sint/—A © sint/—L © Ti(anVt2 — s?) sins /—L d 
— an 
V—A V-L 0 Vt? — s2 V—-L 
Let us specialize to n = 3. It follows from (5.15) of §5 that, on HH, 
sin s/—L 
f/—L 


for s > 0, with r = dist(p, x), and (B.35) gives, for t > 0, 


r Jy (Vt? = r?) 
Arsinhr /t#2 — r2 Mri} 
as in (2.20) of [MT]. Such a formula was used in [MT] (and [MT2]) to obtain 


dispersive estimates on solutions to the wave equation on R x 13, which then led 
to global existence of solutions to certain semilinear wave equations. 


(B.36) 8. 


dp(2) = cee 


B.37 ae, 
( ) 4nsinhr’ 


(B.38) R(t)6,(x) = 


References 


[Ad] B. Adams, Algebraic Approach to Simple Quantum Systems, Springer, New 
York, 1994 


[Ag] 


[Bai] 
[Be] 


[Ber] 
[BLP] 
[BGM] 
[Berl] 
[BM] 
[BSU] 
[Br] 
[BrC] 
[CaG] 


[Cha] 
[Che] 


[CT] 
[Chn] 
[CdV] 


[CCTV] 


[Cor] 
[CG] 
[CyS] 
[DST] 


[Don] 
[DG] 


[DS] 


[Eng] 


References 211 


S. Agmon, Lectures on Elliptic Boundary Problems, Van Nostrand, New York, 
1964 

W. Bailey, Generalized Hypergeometric Series, Hafner, New York, 1972 

P. Berard, On the wave equation on a compact Riemannian manifold without 
conjugate points, Math. Zeit. 155 (1977), 249-276. 

P. Berard, Variétés Riemanniennes isospectrales non isométriques, Sem. 
Bourbaki, #:705, 1988-89 

V. Berestetskii, E. Lifsitz, and L. Pitaevskii, Relativistic Quantum Theory, 
Pergammon, Oxford, 1971 

M. Berger, P. Gauduchon, and E. Mazet, Le Spectre d’une Variété Riemannienne, 
LNM #194, Springer, New York, 1971 

M. Berry, Quantal phase functions accompanying adiabatic changes, Proc. Roy. 
Soc. London A 392 (1984), 45-57. 

H. Bethe and P. Morrison, Elementary Nuclear Theory, J. Wiley, New York, 
1956 (Dover, 2006) 

J. Bowman, T. Senior, and P. Uslenghi, Electromagnetic and Acoustic Scattering 
by Simple Shapes, North-Holland, Amsterdam, 1969 

R. Brooks, Constructing isospectral manifolds, Am. Math. Monthly 95(1988), 
823-839 

J. Brossard and R. Carmona, Can one hear the dimension of a fractal? Commun. 
Math. Phys. 104(1986), 103-122 

Y. Canzani and J. Galkowski, Weyl remainder estimates: an application of 
geodesic beams, Preprint, 2020. 

I. Chavel, Eigenvalues in Riemannian Geometry, Academic, New York, 1984 

J. Cheeger, Spectral geometry of spaces with cone-like singularities, Proc. NAS, 
USA 76(1979), 2103-2106 

J. Cheeger and M. Taylor, Diffraction of waves by conical singularities, Comm. 
Pure Appl. Math. 35(1982), 275-331, 487-529 

P. Chernoff, Essential self-adjointness of powers of generators of hyperbolic 
equations, J. Funct. Anal. 12(1973), 401-414 

Y. Colin de Verdiére, Ergodicité et fonctions propres du Laplacian, Comm. Math. 
Phys. 102 (1985), 497-502 

L. Colzani, A. Crespi, G. Travaglini, and M. Vignati, Equiconvergence theorems 
for Fourier-Bessel expansions with applications to the harmonic analysis of radial 
functions in Euclidean and non-Euclidean spaces, Trans. AMS 338(1993), 43-55 
J. Cornwell, Group Theory in Physics, Academic, London, Vols. 1, 2, 1984; 
Vol. 3, 1989 

W. Cottingham and D. Greenwood, An Introduction to Nuclear Physics, 
Cambridge Univ. Press, Cambridge, 1986 

H. Cycon, R. Froese, W. Kirsch, and B. Simon, Schrédinger Operators, Springer, 
New York, 1987 

E. B. Davies, B. Simon, and M. Taylor, L? spectral theory of Kleinian groups, 
J. Funct. Anal. 78(1988), 116-136 

H. Donnelly, Quantum unique ergodicity, Proc. AMS 131 (2003), 2945-2951 

J. Duistermaat and V. Guillemin, The spectrum of positive elliptic operators and 
periodic bicharacteristics, Invent. Math. 29 (1975), 49-79. 

N. Dunford and J. Schwartz, Linear Operators, I, II, Wiley-Interscience, 
New York, 1958, 1963 

M. Englefield, Group Theory and the Coulomb Problem, Wiley-Interscience, 
New York, 1972 


212 8. Spectral Theory 


[F] 
[Fo] 


[GWW] 
[Gr] 
[GS1] 
[GS2] 


[Haw] 
[Hej] 


[Helg] 
[HMR] 
[Ho] 
[Ing] 


HZ] 
[Iv] 


[Kac] 
[Kat] 


[Lap] 
[LP] 


[Leb] 
[MS] 


[MVW] 


E. Fermi, Nuclear Physics, Univ. of Chicago Press, 1949. 

G. Folland, Harmonic Analysis on Phase Space, Princeton University Press, 
Princeton, NJ, 1989 

C. Gordon, D. Webb, and S. Wolpert, Isospectral plane domains and surfaces via 
Riemannian orbifolds, Invent. Math. 110(1992), 1-22 

P. Greiner, An asymptotic expansion for the heat equation, Arch. Rat. Mech. 
Anal. 41(1971), 163-218 

V. Guillemin and S. Sternberg, Symplectic Techniques in Physics, Cambridge 
University Press, Cambridge, 1984 

V. Guillemin and S. Sternberg, Variations on a Theme by Kepler, AMS Colloq. 
Publ., Vol. 42, Providence, RI, 1990 

J. Hawkins, Ergodic Dynamics, Springer NY, 2021. 

D. Hejhal, The Selberg trace formula for PSL(2,R), 1, LNM #548, I, LNM 
#1001, Springer, New York, 1976, 1983 

S. Helgason, Analysis on Lie Groups and Symmetric Spaces, CBMS Reg. Conf. 
Ser. in Math. #14, AMS, Providence, RI, 1972 

B. Hellfer, A. Martinez, and D. Robert, Ergodicité et limite semiclassique, 
Commun. Math. Phys. 109 (1987), 313-326 

L. Hérmander, The Analysis of Linear Partial Differential Operators, Vols. 3 
and 4, Springer, New York, 1985 

A. Ingham, The Distribution of Prime Numbers, Cambridge University Press, 
Cambridge, 1932 

C. Itzykson and J. Zuber, Quantum Field Theory, McGraw-Hill, New York, 1980 
V. Ivrii, Microlocal Analysis and Precise Spectral Asymptotics, Springer, New 
York, 1998. 

M. Kac, Can one hear the shape of a drum?, Am. Math. Monthly 73(1966), 1-23 
T. Kato, On the adiabatic theorem for quantum mechanics, Jour. Phys. Soc. 
Japan 5 (1950), 435-439. 

M. Lapidus, Fractal drum, inverse spectral problems for elliptic operators, and a 
partial resolution of the Weyl-Berry conjecture, Trans. AMS 325(1991), 465-529 
P. Lax and R. Phillips, Scattering Theory for Automorphic Functions, Princeton 
University Press, Princeton, NJ, 1976 

N. Lebedev, Special Functions and Their Applications, Dover, New York, 1972 

H. McKean and I. Singer, Curvature and the eigenvalues of the Laplacian, J. 
Diff. Geom. 1(1967), 43-69 

R. Melrose, A. Vasy, and J. Wunsch, Propagation of singularities for the wave 
equation on edge manifolds, Duke Math. J. 144 (2008), 109-193. 


[MVW2] R. Melrose, A. Vasy, and J. Wunsch, Diffraction of singularities on manifolds 


[MW] 


[Mes] 
[MT] 


[MT2] 
[Mil] 


[Miz] 


with corners, Astérisque 351 (2013). 

R. Melrose and J. Wunsch, Propagation of singularities for the wave equation on 
conic manifolds, Invent. Math. 156 (2004), 235-299. 

A. Messiah, Quantum Mechanics, North Holland, Amsterdam, 1961 

J. Metcalfe and M. Taylor, Nonlinear waves on 3D hyperbolic space, Trans. 
AMS 363 (2011), 3489-3529. 

J. Metcalfe and M. Taylor, Dispersive wave estimates on 3D hyperbolic space, 
Proc. AMS 140 (2012), 3861-3866. 

W. Miller, Symmetry and Separation of Variables, Addison-Wesley, Reading, 
MA, 1977 

S. Mizohata, The Theory of Partial Differential Equations, Cambridge University 
Press, Cambridge, 1973 


[NRR] 


[Olv] 
[PT] 


[RS] 
[ST] 


[S] 
[Ser] 


[SW] 
[Shn] 
[Shu] 
[Sil] 
(Si2] 
(Si3] 
(Si4] 
[Sob] 
[Som] 
[Stk] 
[Str] 
[Sun] 
(T1] 
[T2] 
[T3] 
(T4] 
[T5] 
[T6] 
[Ter] 


[Wat] 


References 213 


E. Nilsson, J. Rowlett, and F. Rydell, The isospectral problem for flat tori from 
three perspectives, Bull. AMS 60 (2023), 39-83. 

F. Olver, Asymptotics and Special Functions, Academic, New York, 1974 

M. Pinsky and M. Taylor, Pointwise Fourier inversion: a wave equation approach, 
J. Fourier Anal. 3 (1997), 647-703. 

M. Reed and B. Simon, Methods of Mathematical Physics, Academic, New York, 
Vols. 1, 2, 1975; Vols. 3, 4, 1978 

R. Schrader and M. Taylor, Semiclassical asymptotics, gauge fields, and quantum 
chaos, J. Funct. Anal. 83 (1989), 258-316 

E. Segre, Nuclei and Particles, Benjamin, New York, 1977 

G. W. Series, Spectrum of Atomic Hydrogen, Oxford University Press, Oxford, 
1957 

A. Shapere and F. Wilczek, Geometric Phases in Physics, World Scientific, 
Singapore, 1989 

A. Shnirelman, Ergodic properties of eigenfunctions, Usp. Msat. Nauk. 29 
(1974), 181-182 

M. Shubin, Pseudodifferential Operators and Spectral Theory, Springer, 
New York, 1987 

B. Simon, Quantum Mechanics for Hamiltonians Defined as Quadratic Forms, 
Princeton University Press, Princeton, NJ, 1971 

B. Simon, Trace Ideals and Their Applications, Cambridge University Press, 
Cambridge, 1979 

B. Simon, Functional Integration and Quantum Physics, Academic, New York, 
1979 

B. Simon, Holonomy, the quantum adiabatic theorem, and Berry’s phase, Phys. 
Rey. Lett. 51 (1983), 2167-2170. 

S. Sobolev, Partial Differential Equations of Mathematical Physics, Dover, New 
York, 1964 

A. Sommerfeld, Mathematische theorie der diffraktion, Math. Ann. 47(1896), 
317-374 

I. Stakgold, Boundary Value Problems of Mathematical Physics, Macmillan, 
New York, 1968 

R. Strichartz, Analysis of the Laplacian on a complete Riemannian manifold, 
J. Func. Anal. 52(1983), 48-79 

T. Sunada, Riemannian coverings and isospectral manifolds, Ann. Math. 
121(1985), 169-186 

M. Taylor, Pseudodifferential Operators, Princeton University Press, Princeton, 
NJ, 1981 

M. Taylor, Noncommutative Harmonic Analysis, AMS, Providence, RI, 1986 
M. Taylor, Measure Theory and Integration, AMS, Providence RI, 2006. 

M. Taylor, Variations on quantum ergodic theorems, Potential Analysis 43 
(2015), 625-651. 

M. Taylor, Variations on quantum ergodic theorems II, Contemporary Math. 748 
(2019), 251-265 

M. Taylor, Introduction to Analysis in several Variables—Advanced Calculus, 
AMS, Providence RI, 2020. 

A. Terras, Harmonic Analysis on Symmetric Spaces and Applications, Springer, 
New York, 1985 

G. N. Watson, A Treatise on the Theory of Bessel Functions, Cambridge 
University Press, Cambridge, 1945 


214 8. Spectral Theory 


[WW] E. Whittaker and G. N. Watson, A Course of Modern Analysis, Cambridge 
University Press, Cambridge, 1927 
[WZ] F. Wilczek and A. Zee, Appearance of gauge structure in simple dynamical 
systems, Phys. Rev. Lett. 52 (1984), 2111-2114 
[Zel] S. Zelditch, Uniform distribution of eigenfunctions on compact hypersurfaces, 
Duke Math. J. 55 (1987), 919-941 
[Zw] M. Zworski, Semiclassical Analysis, AMS, 2012. 


9 ® 


Check for 
updates 


Scattering by Obstacles 


Introduction 


In this chapter we study the phenomenon of scattering by a compact obstacle in 

Euclidean space R*. We restrict attention to the three-dimensional case, though 

a similar analysis can be given for obstacles in R” whenever n is odd. The Huy- 

gens principle plays an important role in part of the analysis, and for that part the 

situation for n even is a little more complicated, though a theory exists there also. 
The basic scattering problem is to solve the boundary problem 


(0.1) (A+k*)u=0in0, v=fondK, 


where 2 = R® \ K is the complement of a compact set 1’. (We also assume (2 is 
connected.) We place on v the “radiation condition” 


(0.2) r (5 - ike) — 0, asr —> oo, 
Or 

in case k is real. We establish the existence and uniqueness of solutions to (0.1) 

and (0.2) in §1. Motivation for the condition (0.2) is also given there. 

Special choices of the boundary value f give rise to the construction of the 
Green function G(xz,y,k) and of “eigenfunctions” u+(a, kw). In §2 we study 
analogues of the Fourier transform, arising from such eigenfunctions, providing 
®.., unitary operators from L?(Q) to L?(IR3) which intertwine the Laplace op- 
erator on Q, with the Dirichlet boundary condition, and multiplication by |€|? on 
DE? (R). 

For any smooth f on OK, the solution to (0.1) and (0.2) has the following 
asymptotic behavior: 


(0.3) v(ré) = r_te*r al f, 6,k)+0(r—'), r—>o, 


known as the “far field expansion.” In case f(x) = —e’*“’* on OK, the coefficient 
is denoted by a(w, 6, ) and called the “scattering amplitude.” This is one of the 


© The Author(s), under exclusive license to Springer Nature Switzerland AG 2023 215 
M. E. Taylor, Partial Differential Equations I, Applied Mathematical Sciences 116, 
https://doi.org/10.1007/978-3-03 1-33700-0_9 


216 9. Scattering by Obstacles 


fundamental objects of scattering theory; in §3 it is related to the unitary operator 
S = 6,6 ' on L?(R’), the “scattering operator.” 

The term “scattering” refers to the scattering of waves. Connection with the 
wave equation is made in §84 and 5, where the scattering operator is related to the 
long-time behavior of the solution operator for the wave equation, in counterpoint 
to the long-distance characterization of the scattering amplitude given in §1. In 
the study of the wave-equation approach to scattering theory, a useful tool is a 
semigroup Z(t), introduced by Lax and Phillips, which is described in §6. 

Section 7 considers the meromorphic continuation in & of the solution operator 
to (0.1) and (0.2). This operator has poles in the lower half-plane, called scattering 
poles. The analytical method used to effect this construction involves the classical 
use of integral equations. We also relate the scattering poles to the spectrum of the 
Lax—Phillips semigroup Z(t). In §8 we derive “trace formulas,” further relating 
the poles and Z(t). In §9 we illustrate material of earlier sections by explicit 
calculations for scattering by the unit sphere in R?. 

In §810 and 11, we discuss the “inverse” problem of determining an obstacle 
Kx, given scattering data. Section 10 focuses on uniqueness results, asserting that 
exact measurements of certain scattering data will uniquely determine K. In $11 
we discuss some methods that have been used to determine K approximately, 
given approximate measurements of scattering data. This leads us to a discussion 
of “ill-posed” problems and how to regularize them. 

In §12 we present some material on scattering by a rough obstacle, pointing 
out similarities and differences with the smooth cases considered in the earlier 
sections. Appendix A at the end of this chapter is devoted to the proof of a trace 
identity used in §8. 

We have confined attention to the Dirichlet boundary condition. The scattering 
problem with the Neumann boundary condition, and for electromagnetic waves, 
with such boundary conditions as discussed in Chap. 5, are of equal interest. There 
are also studies of scattering for the equations of linear elasticity, with boundary 
conditions of the sort considered in Chap.5. Many of the results in such cases 
can be obtained with only minor modifications of the techniques used here, while 
other results require further work. For further material on the theory of scattering 
by obstacles, consult [LP1], [Rm], [CK], and [Wil]. 

Another important setting for scattering theory is the Schrddinger operator 
—A+V; see [RS], [New], and [Ho] for material on this. We include some exer- 
cises on some of the simplest problems in this quantum scattering theory. These 
exercises indicate that very similar techniques to those for scattering by a compact 
obstacle apply to scattering by a compactly supported potential. It would not take 
a much greater modification to handle potentials V (a) that decay very rapidly as 
|x| + co. Such potentials, with exponential fall-off, are used in crude models of 
two-body interactions involving nuclear forces. It takes more substantial modifi- 
cations to treat long-range potentials, such as those that arise from the Coulomb 
force. The most interesting quantum scattering problems involve multiparticle in- 
teractions, and the analysis of these requires a much more elaborate set-up. See 
[Vas]. 


1. The scattering problem 217 


Further directions include the scattering behavior of waves on noneuclidean 
spaces, with special structure at infinity. For major developments in this area, see 
[Me4, Bor], and [DZ]. 


1. The scattering problem 


In this section we establish the existence and uniqueness for the following 
boundary problem. Let K Cc R® be a compact set with smooth boundary and 
connected complement (2. Let f € H*(OK) be given, and let k > 0. We want to 
solve 


(1.1) (A+k?)v=0 onQ, 
(1.2) v=f ondK. 


In addition, we impose a “radiation condition,” of the following form: 


Ov 


(1.3) lru(a)| << C, or (5 


- ike) — 0, asr— oo, 
where r = |a|. This condition will hold provided v satisfies the integral identity 


a4 oe)= f [ro s2 nk) — oe mbZW)| atu), 


OK 


for x € Q, where 
(1.5) g(a,y, k) = (Ale — yl)" elsP¥, 

Our existence proof will utilize the following fact. If k > 0 is replaced by 
k+ie, e > 0, then —(k+ie)? belongs to the resolvent set of the Laplace operator 
A on Q, with Dirichlet boundary conditions on OK. Hence, for s > 3/2, (1.1)- 


(1.2) (with k replaced by k + ic) has a unique solution v. € L?(Q). To obtain 
this, extend f to f# € H?(Q), and set p = (A + (k + ie)?) f# € L?(Q). Then 


ve = f# — (At (k+ie)?) "yp. 


Furthermore, in this case, the integral formula (1.4) does hold, as a consequence 
of Green’s theorem, with g(x, y, k) replaced by 


(1.6) g(x,y, k + ie) = (Ama — yl)? e929, 


218 9. Scattering by Obstacles 


which, as we saw in Chap. 3, is (the negative of) the resolvent kernel for (A+ (k+ 
ic)*)~+ on free space R®, a kernel that converges to (1.5) as ¢ \, 0. The strategy 
will be to show that, as € \, 0, vz converges to the solution to (1.1)—(1.3). 

Before tackling the existence proof, we first establish the uniqueness of solu- 
tions to (1.1)-(1.3), as this uniqueness result will play an important role in the 
existence proof. 


Proposition 1.1. Given k > 0, if v satisfies (1.1)-(1.3) with f = 0, then v = 0. 


Proof. Let Sp denote the sphere {|x| = R} in R®; for R large, Sp C Q, and, 
with v, = Ov/Or, we have 


(7) [he — iky|? dS = J (iol + k?|v|?) ds — ik [ (ve, —vv,) dS. 
SR Sr SR 
Now Green’s theorem applied to v and v implies 
OU 0 
(1.8) [0m —m,) dS = [eH-*) dS =0, 
Sr OK 
provided v|ax = 0. Since the hypothesis (1.3) implies 
(1.9) [ler— ite)? as 0, as R — 00, 
Sr 
we deduce from (1.7) that 
(1.10) fre dS —>0, as R — oo. 
Sr 


The proof of Proposition 1.1 is completed by the following result. 


Lemma 1.2. If v satisfies (A + k?)v = 0 for |x| > Ro and (1.10) holds, then 
v(x“) = Ofor |x| > Ro. 


Proof. It suffices to prove that, for r > Ro, 
(1.11) V(r) = frre) dS (w) 
S2 


is identically zero, for each eigenfunction y of the Laplace operator Ag on the 
unit sphere S?: 


(1.12) (Astp*)p=0 (u>0). 


1. The scattering problem 219 


In view of the formula for A on R® in polar coordinates, 


(1.13) A 


it follows that V(r) satisfies the ODE 


2 2 
(1.14) V"(r) +2 V"(r) + (x? = = )Vv(r) =. aS Re 

r r 
This ODE has two linearly independent solutions of the form r—t/2 Gi (kr), 
j=1,2, where HS? (z) and HH?) (z) are the Hankel functions discussed in 


Chap. 3, and v? = yp? + 1/4. In view of the integral formulas given there, it 
follows that the asymptotic behavior of these two solutions is of the form 


(1.15) Va(r) = Cy rte" + ofr), 7 — oo. 


Clearly, no nontrivial linear combination of these two is o(r~+) as r > oo. Since 
the hypothesis implies that V(r) = o(r~+), we deduce that V = 0. 


Applying Lemma 1.2, we see that under the hypotheses of Proposition 1.1, 
v = 0 for |x| > Ro, given that K C {a : |x] < Ro}. Since v satisfies the unique 
continuation property in 2, this implies v = 0 in 2, so Proposition 1.1 is proved. 


Remark: The uniqueness proof above really used (1.9), which is formally weaker 
than the radiation condition (1.3). Consequently, (1.9) is sometimes called the 
radiation condition. On the other hand, the existence theorem, to which we turn 
next, shows that the formally stronger condition (1.3) holds. 

The following result, which establishes the existence of solutions to (1.1)- 
(1.3), is known as the limiting absorption principle. 


Theorem 1.3. Let s > 3/2, and suppose that as € \, 0, 
(1.16) fe—>f in H°(OK). 
Let vz be the unique element of L?(Q) satisfying 


(1.17) (A+(k+ie)*)ve=0 inQ, 
(1.18) ve = fe ondOK. 


Then, as € \, 0, we have a unique limit 


(1.19) Ve — v= B(k)f, 


220 9. Scattering by Obstacles 


satisfying (1.1)-(1.3). Convergence occurs in the norm topology of the space 
L?(Q, (a)~1~° dr) for any 5 > 0, as well as in Hehe (Q), and the limit v satis- 
fies the identity (1.4). 


It is convenient to divide the proof into two parts. Fix R such that kK C {|z| < 
R} and letOrp =2N {|z| < R}. 


Lemma 1.4. Assume Velen is bounded in L?(OpR) as € \, 0. Then the conclu- 


sions of Theorem 1.3 hold. 


Proof. Fix S < Rwith K Cc {|x| < S}. The elliptic estimates of Chap. 5 imply 
that if ||v-||12(0,) is bounded, then 


(1.20) Ilvell #2+1/2(05) < Cy + Cell fell e(ax)- 
Passing to a subsequence, which we continue to denote by v-, we have 
(1.21) Ve —>v_ weakly in H*+1/2(Qz), 


for some v € H*+1/?(Og). The trace theorem implies weak convergence 


(1.22) Velax —? lax in H*(OK), 
and 
Ove Ov . 
(1.23) pp a ee (OK). 
Since each v, satisfies 
_ OE Ove(y) 
a2) la) = f(t FE - aS) as), veo, 


OK 


with g- = g(z,y,k + ie) given by (1.6), we deduce from (1.22) and (1.23) that 
the right side of (1.24) converges locally uniformly in z € , as € \, 0, to a limit, 
call it v, that coincides with the limit (1.21) on Og. Furthermore, in view of the 
formula (1.6), we have the estimate 


(1.25) |ve(x)| << Cy(x)", «EQ, 


with Ci, independent of ¢. Thus the limit v satisfies this estimate, and we have 
Ve > vin L?(Q, (x)~!~° dx) for any 5 > 0. Furthermore, the limit v satisfies the 
identity (1.4), so the radiation condition (1.3) holds. 

So far we have convergence for subsequences, but in view of the uniqueness 
result of Proposition 1.1, this limit v is unique, so Lemma 1.4 is proved. 


The proof of Theorem 1.3 is completed by the following argument. 


1. The scattering problem 221 


Lemma 1.5. The hypotheses (1.16)-(1.18) of Theorem 1.3 imply that {vz} is 
bounded in L?(Q, (x)~1~° dz), for any 5 > 0. 


Proof. Fix such a6. Suppose N- = ||v-||12(@,(x)-1- dx) —> 00 for a subsequence 
En \, 0. Set we = No'v-. Then Lemma 1.4 applies to w,, with Wel are = fi = 


é€ 


Nz>' fz + 0in H*(0K). Thus the conclusion of Lemma 1.4 gives 
We —> w strongly in L?(Q, (x)~1~? dz). 


The limit w satisfies the scattering problem (1.1)—(1.3) with f = 0, so our unique- 
ness result implies w = OQ. This contradicts the fact that each w- has norm 1 in 
L?(Q, (x)~1~° dz), so the proof is complete. 

Remark: Considering the dense subspace H**!(0K) of H*(0K), we can im- 
prove weak convergence of v- — v in H, oe 


loc (£2) to strong convergence in this 
space. 


We draw a couple of conclusions from Theorem 1.3. The first concerns the 
limiting behavior as « \, 0 of the Green function G(x, y, k + ie), the kernel for 


the resolvent (A + (k + ie)?) ~* on Q, which is of the form 
(1.26) G(x,y,k + ie) = g(x,y,k + ie) + h(x,y, k + te), 


where g(x, y, & + ie) is the free-space Green kernel (1.6) and h(x, y, k + ie) is, 
for each y € Q, the element of L7(Q) satisfying 


(Az + (k-+te)*)h =0, 


(1.27) : ; 
h(a,y,k +%e) =—g(2,y,k+ie), forx ¢ OK. 


Clearly, ase \, 0, g(a, y, k+te) > g(x,y, k), given by (1.5). On the other hand, 
for any y € Q, Theorem 1.3 applies to f.(a) = —g(a,y,k + ie), and we have 


(1.28) h(x, y,k + ie) + h(x, y, k), 


where h(x,y,k) solves the scattering problem (1.1)-(1.3), with h(z,y,k) = 
—g(a,y,k) for x € OK. Consequently, as e \, 0, 


(1.29) G(z,y,k + te) —> G(z,y,k), 
where 
(1.30) G(z,y,k) = g(z,y,k) + h(a, y,k). 


Another important family of functions defined by a scattering problem is the 
following. Note that we have 


(1.31) (A+ |€?)e*> =0 on RS, 


222 9. Scattering by Obstacles 

for any € € R®. We define the functions u(x, €) on Q x R° by 

(1.32) u(x, €) =e **§ + (2, €), 

where (2, €) satisfies the scattering problem (1.1)—(1.3), with k? = |€|? and 
(1.33) v(z,£) =-e "> on AK. 


As we will see in the next section, u(x,&) plays a role on © of generalized 
eigenfunction of the Laplace operator on 2, with Dirichlet boundary conditions, 
analogous to the role played by uo(x, €) = e~**’§ on R?. 

There is an interesting relation between the Green function G(x, y, &) and the 
“eigenfunctions” u(a, €), which we give here, which will play an important role in 
the analysis in the next section. It involves the behavior of G(x, y, k) as |y| — oo. 


Proposition 1.6. For y = rw, w € S?, r — 06, and any fixed k > 0, 
(1.34) G(x, rw, k) = (Arr) te" u(x, kw) + O(r-). 
This is uniformly valid for (x,w,k) in any bounded subset of Q x S? x Rt. 


Proof. Write G(x, rw, k) = g(x, rw, k)+h(x, rw, k), as in (1.30). Thus h(x) = 
h(a, rw, k) satisfies 


(1.35) (A+ k?)h,(x) = 0, Palas = —g(z,rw,k), 


together with the radiation condition as || —> oo. Now, in view of (1.5), as 7 > 
oo, we have, for x € OK, or indeed for x in any bounded subset of R?, 


(1.36) g(x, rw, k) = (4ar)—te**e*"? 4 O(n), 


where the remainder is O(r~7) in C“(OK) for any @. Thus, in view of the esti- 
mates established in the proof of Theorem 1.3, we have 


(1.37) hy = (4nr)~te*" u(x, kw) + O(r-?), 7 —> 00, 
with u(x, €) defined above. This gives the desired result (1.34). 


We remark that a similar argument gives 


) i 
(1.38) Bp oe rw,k) = (4nr)~ tk e**" u(x, kw) + O(r-?), 
r 
as Tr —> OO. 
Note that, for any f € C™(0K), by (1.4) we have an asymptotic behavior of 
the form 


1. The scattering problem 223 
(1.39) v(r6) = r—te*"a(f,0,k) +0(r-!), ro, 


with 6 € $?, for the solution to the scattering problem (1.1)-(1.3), with a smooth 
coefficient a(f,-,-). Also, 


(1.40) 2 aye) = Taio 0,k) + 0(r7'). 
Or r 


In particular, the function v(a, €) given by (1.33) has the asymptotic behavior 
(1.41) u(r, kw) ~ rte!*"a(—w,0,k), 1 co, 


for fixed 0,w € 9, k € R™, and its r-derivative has an analogous behavior. The 
coefficient a(w, 0, k) is called the scattering amplitude and is one of the funda- 
mental objects of scattering theory. We will relate this to the scattering operator 
in §3. 

The radiation condition (1.3) is more specifically called the “outgoing radiation 
condition.” It has a counterpart, the “incoming radiation condition”: 


Ov 


(1.42) Irv(x)| < C, "(> 


+ ikv) — 0, aro. 


Clearly there is a parallel treatment of the scattering problem (1.1), (1.2), (1.42). 


Indeed, if v(x) satisfies (1.1)—(1.3), then v(x) satisfies the incoming scattering 
problem, with f replaced by f, and conversely. In particular, we can define 


(1.43) u_(z,€) =e"** + v_(a, €), 
where v_ (x, €) satisfies the scattering problem (1.1), (1.2), (1.42), with 
(1.44) v_(z,€) = —e**5 on OK, 


and we clearly have 


(1.45) v_(z,é) =v(e,—8), u_(z,é) = u(e,—8). 
In analogy with (1.41), we have the asymptotic behavior 

(1.46) v_(r0, kw) ~ r-te-**’a_(w,0,k), 1 —> 00, 
with 


(1.47) a_(w,6,k) = a(w,6,k). 


224 9. Scattering by Obstacles 


Sometimes, to emphasize the relation between these functions, we use the notation 
u+(x, €), vi(a, €) and a+(w,6,k) for the functions defined by (1.32) and (1.33) 
and by (1.41). 

We note that while the discussion above has dealt with k > 0, the case k = 0 
can also be included. In this case, the proof of Proposition 1.1 does not apply; for 
example, (1.7) no longer implies (1.10). However, the existence and uniqueness 
of a solution to 


(1.48) Av=0onQ, v=f ondK, 
satisfying 
(1.49) |rv(x)|<C, |r? Ov] < C, asr > 0, 


is easily established, as follows. We can assume that the origin 0 € R® is in the 
interior of K’. Then the inversion 7(2) = «/|x|? interchanges 0 and the point at 
infinity, and the transformation 


(1.50) v(x) = |a|~*w(\2|~22) 


preserves harmonicity. We let w be the unique harmonic function on the bounded 
domain 7(Q), with boundary value w(x) = |x|~1 f(w(a)) on OW(Q) = W(OK). 
It is easily verified that v(x) satisfies (1.49) in this case. Conversely, if v(x) satis- 
fies (1.48) and w is defined by (1.50), then w is harmonic on w(Q) \ 0 and equal 
to foy on y(OK). If v also satisfies (1.49), then w is bounded near 0, and so is 
r Ow/Or. 

Now the boundedness of w near 0 implies that 0 is a removable singularity 
of w, since Aw € D’((Q)) is a distribution supported at 0, hence a finite linear 
combination of derivatives of 6(x), which implies that w is the sum of a function 
harmonic on y(Q) and a finite sum of derivatives of |x|~+, and the latter cannot be 
bounded unless it is identically zero. Similarly, r Ow /Or is harmonic on w(Q) \ 0, 
and if it is bounded near 0 then it extends to be harmonic on ¢)(Q), and this in turn 
implies that w extends to be harmonic on 7(Q). Therefore, either one of the two 
conditions in (1.49) gives uniqueness. Of course, if f € C(O0K) the uniqueness 
of solutions to (1.48), satisfying the first condition in (1.49), follows from the 
maximum principle. 

With this result established, the limiting absorption principle, Theorem 1.3, 
also holds for k = 0. We also note that the proof of Theorem 1.3 continues to 
work if instead of using k + te (€ \, 0) in (1.17), one replaces & + ie by any 
A(e) approaching k € [0,co) from the upper half-plane. Furthermore, the limit 
v depends continuously on k. In particular, the functions u+(x, €) defined above 
are continuous in € € R3, and ax(w,6,k) is continuous on $? x $? x [0,00). 

There is a natural fashion in which u4(a,€) and u_(a, &) fit together, which 
we describe. This will be useful in §4. Namely, for k € R, w € 97, set 


1. The scattering problem 225 


(1.51) Us(2,k,w) =e" + Vi (a, k 


,W);, 


where V, satisfies (1.1)—(1.3) and V_ satisfies (1.1), (1.2), (1.42), with the bound- 
ary condition V; = —e—**® for ¢ € OK. In each case, k is not restricted to be 
positive; we take any /& € R (using (1.49) for k = 0). It is easy to see that, 


for any k > 0, Va(a,k,w) 


k>0=> U; 
k<0=> Ux(a,k, 


(1.52) 


= vi(z,kw), while, fork < 0, V4 
vz(a, —|klw) = ve(x, kw). Consequently, 


i(x,k,w = 
w) = 


Wwe 


i(2,k,w) = 


Similarly, we can define A+(w,0,k) for k € R. Note that as r > +00, 


k>0= > Vj 
(1.53) 


Ere, k,w) ~ s 


k<0=>V, 


Exercises 


E(r 8, k, Ww) ie 


tikr 


FW, 0, k), 


+w,6,k). 


1. Let v solve (1.1)-(1.3), with f € H'(OK), with k > 0. Show that 


satisfies 


for all R such that K C Br(0), and that 


2 
6= 3m 2 f (| + (2) as =k [ la(f,0,6)[? a 
S2 


|c|=R 


The quantity ® is called the flux of the solution v. Show that 6 = 0 implies v = 0. 


(Hint: Refer to the proof of Proposition 1.1.) 


2. Investigate solutions of (1.1)-(1.3) for f € H°(OK) with s < 3/2. (Hint: When 
extending f to f* € H erty ?(Q), use a parametrix construction for the Dirichlet 


problem for A + k?.) 


3. If (A+k?)u(x) = 0 for x € O, open in R”, note that w(x, y) = v(a)e*® is harmonic 
on O xR C R"*". Deduce that v must be real analytic on O, as asserted in the unique 
continuation argument used to prove Proposition 1.1. 

4. With a(w, 6, k) defined for k € R so that (1.53) holds, show that 


(1.54) k > 0 = > a(w,0,—k) = a(w,6,k). 


Relate this to (1.47). 


226 9. Scattering by Obstacles 


5. If the obstacle K2 is obtained from Ky by translation, K2 = Ki + 7, show that the 
scattering amplitudes are related by 


Ake (w, 6, k) = a ak, (w, 6, k). 


The following exercises deal with the operator H = —A + V on R?, assuming V() 
is a real-valued function in CG° (R°). We consider the following variant of (1.1)-(1.3), 
given f € Liomp(R®): 


(1.55) (A-V+k)u=f oR’, 


(1.56) lrv(a)| << C, or & — inv) — 0, asr— oo. 


6. Show that if k > 0 and v satisfies (1.55)-(1.56) and f = 0, then v = 0. (Hint: Modify 
the proof of Proposition 1.1, to get v(x) = 0 on R* \ Br, given V supported on Br. 
Then use the following unique continuation result: 

Theorem UCP. /f L is a second-order, real, scalar, elliptic operator on a connected 
region Q, Lv=0o0n Q, and v =0 on a nonempty open set O C Q, thenv=0 on. 
A proof of this theorem can be found in [Ho], or in Chap. 14 of [T3].) 

7. Show that H has no positive eigenvalues. (Hint: Use similar reasoning, with an ap- 
propriate variant of Lemma 1.2.) Obtain an analogue of Proposition 7.3 of Chap. 8, 
regarding negative eigenvalues. 

8. Modify the proof of Theorem 1.3 to obtain a (unique) solution of (1.55)—(1.56), as a 
limit of (—H + (k + ie)?)\~* f,ase \, 0, given k > 0. Show that (parallel to (1.4)) 
the solution v satisfies 


(1.57) v(x) = - [vow + f(y)) g(x,y, k) dy = R(k)(Vu + f). 


This is called the Lippman—Schwinger equation. 
9. Let u(x, €) =e“ * + v(a, €), where v satisfies 


(A-V+k)u=V(a)e**, k= [€)?, 


and (1.56). Establish an analogue of Proposition 1.6 and an analogue of (1.41), yield- 
ing a(—w, 0, k). Note the following case of (1.57): 


(1.58) xn Goa— / V(y)us (2, Ogla,y, k) dy. 


10. Note that the argument involving (1.48)—-(1.50) has no analogue for the k = 0 case of 
(1.55)-(1.56). Reconsider this fact after looking at Exercise 9 of §9. 


2. Eigenfunction expansions 227 
2. Eigenfunction expansions 


The Laplace operator on 2 with the Dirichlet boundary condition, that is, with 
domain 
D(A) = Hy (Q)N *(Q), 


is self-adjoint and negative, so by the spectral theorem there is a projection-valued 
measure dE (A) such that 


Q.1) (ayo = f * g(a) dB(A)n, 


for any bounded continuous function y. Furthermore, this spectral measure is 
given in terms of the jump of the resolvent across the real axis: 


(2.2) »y(—A)v = lim 


lim 5 f pA)[(A +A—ie)"t — (A+ A+ ie) "Ju dr. 


Using the kernel G(x, y, k + ie) for (A + (k + ie)?)\~", we can write this as 


(2.3) y(—A)v(z) = lim = a [A (k?) Im G(a, y, k + ie)u(y) dy k dk. 
ENO 7 


From the limiting behavior 
G(a,y,k + ie) —> Gla, y, k) 


established in §1, we can draw the following conclusion. 


Proposition 2.1. The operator A on Q has only absolutely continuous spectrum. 
For any continuous (p with compact support, we have 


(2.4) y(vV—A)u(x) = f° fm G(a,y,k)u(y) dy v(k) k dk. 
a 


The meaning of the first statement of the proposition is that the spectral measure 
is absolutely continuous with respect to Lebesgue measure. 

The primary goal of this section is to give the spectral decomposition of 
the Laplace operator on 2 in terms of the “eigenfunctions” u(x, €) defined by 
(1.32)-(1.33). We use a modified version of an approach taken in [Rm]. In view 
of (2.4), the following result plays a key role in achieving the spectral decom- 
position. 


Proposition 2.2. We have the identity 


(2.5) Im G(a,y,k) = [uc kw) u(y, kw) dw. 


S2 


k 
167? 


228 9. Scattering by Obstacles 


Proof. We obtain this identity from the asymptotic result of Proposition 1.6, as 
follows. Applying Green’s theorem to G(x, y,k) and G(a,y,k), and using the 
fact that they both vanish for « € OK, we have 


Im G(z, y, k) 
26) = xi | [o (« G dS 
= 1y,k ZY, ry, ZY, z); 
){O@uh - Gea 5 —Cle.9.h)] d5(2) 
for R large, where Sp = {z € R?: |z| = R}. Letting R — oo, and using (1.34) 


and (1.38), gives (2.5) in the limit. 


In view of (2.5), we can write the identity (2.4) as 
OD — o(V=A)u(z)=@ny8 ff u(e,guly,Su(udo(lel ay ae. 
R3 Q 


Therefore, we are motivated to define the following analogues of the Fourier 
transform: 


(28) (@)(@) = 2m)? f oa ay 
Q 

and 

(2.9) (B*w) (2) = (2n)-3/ / u(x, €)w(€) dé. 
RS 


We aim to prove that © defines a unitary transformation from L?(Q) onto L?(R3), 
with inverse ®*. Note that §1 gives the estimate 


(2.10) ju(x,€)| < 1+ C(€)(2)*, 


with C(€) locally bounded, but we have obtained no bound on C(&) as |£| + oo, 
so our analysis of ® and ®* will require some care. The following results on ® 
and ®* are elementary. 


Lemma 2.3. We have 


(2.11) & : CS°(Q) — C(R®), 
(2.12) ®* : Lp (R°) — L?(Q)N CMO), 
and 


(2.13) (®*w,v) = (w, &v), forv € Cp°(Q), w € Le,,,(R°). 


2. Eigenfunction expansions 229 
We also note that (2.7) gives 
(2.14) y(V—A)v = &* (y(|é|)@v), for v € C§°(Q), y € CF°(R). 
Using these results, we will be able to establish the following. 
Proposition 2.4. [fv € O§°(Q), then ®v € L?(R°) and 
(2.15) |u|] 22 Rs) = |lullz2(a)- 
Consequently, ® has a unique extension to an isometric map 
(2.16) ®: L7>(Q) — L?(R°), 
and ®* has a unique continuous extension to a continuous map 
(2.17) &* : 17(R*) — 17(0), 
the adjoint of (2.16). 
Proof. Given y € C§°(R), v € C§°(Q), we have 


(v(lé]) bv, dv) = (B* y((é]) bv, v) (by (2.13)) 


(2.18) = (y(V—A)v, 0). (by (2.14)) 


In other words, 
2 
2.19) [ elie |be(@)) d& = (eV/=B)e,0). 
R3 
Now let y 7 1. The monotone convergence theorem applies, so 
2 
(2.20) [leo dé = (v,v). 
R3 
This proves the proposition. 


In order to prove that (2.16) is surjective—hence unitary—we will need to 
know that * in (2.17) is injective. Before proving this, it will be useful to estab- 
lish the following. 


Proposition 2.5. For any even ip € C,(R) (i.e, y continuous and p(t) > 0 as 
|t| + 00), and for any w € L?(R*), 


(2.21) &* (p(|é|)w) = o(V—A) O*w. 


230 9. Scattering by Obstacles 


Proof. It suffices to establish this identity for w € C§°(IR°). For such w, we have 
(1 A)o*w(0) = 2x)? f u(n,€)(6)?w(G) dé = &* ((€)?u), 
R3 


the left side a priori a distribution on ©. By (2.17), we know that *((£)?w) 
€ L?(Q). The integral above clearly belongs to C®°(Q) and vanishes on 00. 
Thus ©*w(a) belongs to the domain of A*, where 


D(A.) = {u€ C*(Q) : u=0 on OQ, supp u bounded}. 


It follows from Proposition 2.6 of Chap.8 that A is essentially self-adjoint on 
D(A), so we conclude that ®* w(x) belongs to the domain of A, namely, to 
Hg (Q) n H?(Q). 
An inductive argument shows that ®*w belongs to the domain of each self- 

adjoint operator (1 — A)* and 

(1 — A)*O* w(x) = &* ((€)?* wv). 
Replacing w by (€)~?“ w, we deduce 

&* (6) ?**w) = (1— A) * O*u, 
for all w € C§°(R*). From this we get 


B* (\E|79 (6) 2 w) = AM(1 — A)-* O*w. 


Consequently, the identity (2.21) is valid for any y(t) = t77(t)-?*, j < K. 
Now the space of finite linear combinations of such is dense in the space of 
even elements of C, (IR), with the sup norm, by the Stone-Weierstrass theorem, so 
(2.21) holds in general. 


We also have the following dual result. 


Proposition 2.6. For v € L?(Q), y € C,(R) even, we have 
(2.22) B(p(v—A)v) () = v(lé1) (Bu) (6). 


Proof. Since, by Proposition 2.4, ® and ®* are L?-continuous and adjoints of 
each other, this follows directly from (2.21). 


We now prove the asserted unitarity of ® and ®*. 


Proposition 2.7. The map ©* is injective on L?(IR*). Hence the maps (2.16) and 
(2.17) are unitary and are inverses of each other. 


Exercises 231 


Proof. By Proposition 2.5, if w € ker ®*, then y(|€|)w © ker ®*, for any yp € 
C6° (R). Hence if ker &* is nonzero, it contains an element with compact support. 
Let w denote such an element. Then 


(2.23) 0= / u(y, €)o(lel\w(€) dé, for ally €Q, 
R3 


for any continuous y, the integral being absolutely convergent. This being the 
case, we can take 


(2.24) (El) = g(x,y, II). 


Also, we can use (|&|) = Og(2, y, |€|)/Oly|, and we can also replace u(y, €) by 
du(y, €)/O|y|. Consequently, for all r > Ro, such that K c {x € R®: |2| < 
Ro}, we have 


Jf © [ue.0 55 (e.uled - aaa led 5(or6)] a5) a 


lyl=r 


(2.25) 
— 0, 


for all x € R%. In the limit 7 — 00, this gives 
(2.26) 0= [woe dé, forall z € R®. 


In other words, the Fourier transform of w vanishes identically. This implies 
w = 0 and completes the proof. 


If we replace u(x, €) = u4(a,&) by u_(z, €), given by (1.43)-(1.45), we can 
define the operator @_ by 


(2.27) (&_0)(€) = (2m)-8? / vlyuny,®) dy. 
Q 


The arguments as above show that ®_ provides a unitary operator from L?(Q) 
onto L?(R3), and the intertwining property (2.22) also holds for ®_. The relation 
between ®_ and ® is important in scattering theory; often we denote ® by ® to 
emphasize this. 


Exercises 


1. If p € CG°(R) is even, show that the Schwartz kernel of y(./—A) is given by 


(2.28) Ko(e,y) = (2n)~* 7 ula, uly, E)o((El) a. 


R3 


232 9. Scattering by Obstacles 


In particular, 


(2.29) K (2,2) = (2n)78 : lu(e, €)|2p([él) a. 


2. Show that (2.29) is also valid for p(A) = yi(A) = eb” given t > 0. (Hint: Let 


3 € Co°(R), 5 7 t-) 
3. Show that the heat kernel Hi (2, y) on Q x Q of e'4, with Dirichlet boundary condition, 
has the pointwise bound 


Hix, y) < (4nt)—9/2 7 - 9/4, 


Deduce that, for each x € Q, 


(2n)-8 / lula, €)[? etl” ag < (Art) 9/2, 


R3 


and hence 


(2.30) / |u(x, €)|? dé < CR. 


[gl<Rk 


4. Deduce that (2.28) and (2.29) remain valid for even y € S(R), indeed, for continuous 
even y satisfying |y(A)| < C(A)~47£, e > 0. 

5. Verify that (2.26) follows from (2.25). (Hint: If e~*’§ is substituted for u(y,€) in 
(2.25), Green’s formula applies. If u(y, €) is substituted, use the asymptotic behavior to 
show that the inner integral tends to 0 as r — oo.) 

6. Produce results parallel to those of this section for H = —-A+V, given V € C§°(R°), 
real, u(a, €) as in Exercise 9 of §1. Show that ® : 7H. — L?(R°) is unitary, where H- is 
the orthogonal complement of the set of eigenfunctions of H (with negative eigenvalue, 
if any). To what extent does k = 0 cause a problem? 


3. The scattering operator 
In §2 we produced the two unitary operators 


(3.1) 64: 1°(Q) — L?(R°), 


defined for f € C§°(Q) by 


(3.2) (®s. /)(€) = (2n)-3? / une, €) f(y) dy. 


Q 


From these one constructs the unitary operator 


(3.3) S=6,6* : L°(R*) > 1(R°), 


3. The scattering operator 233 


called the scattering operator. Recall that ®, and ®_ intertwine y(./—A) on 
L?(Q) with multiplication by y(|€|) on L?(R3), for ¢ € C,(R). It follows that S 
commutes with such (|€|): 


(3.4) Sp(Iél) = eg) 5. 


From the definition (3.3) we see that S is uniquely characterized by the property 


(3.5) S(v(lél)u_(y,-)) = o(lél)ur(,-), forall y € Q, 


for all yp € C§°(R). We will relate the operator S to “wave operators” in §5. 
We aim to establish the following formula for S, in terms of the scattering 
amplitude a(w, 0, &) defined in §1. 


Proposition 3.1. For g € C§°(R°), we can write 
(3.6) (Sg)(€) = S(k) g(kw), E=kw, we S?, 


where, for each k € R*, S(k) is a unitary operator on L?(S7) given by 


(3.7) S(k) f(w) = f(w) + sy [ au. 6, 170) dO. 


S2 


Proof. Let 
(3.8) wy, kw) 7 u+(y, kw) — Uu_ (y, kw) = v+(y, kw) — VU (y, kw). 


The assertion above is equivalent to the integral identity 


(3.9) wy, kw) = = fa, Fus(y.h8) do. 
S2 


In order to prove this, note that, since w(x, kw) = 0 for x € OK, Green’s theorem 
gives, for R > |yl, 


a. 


Aa (a, kw)| d(x), 


ac 
(3.10) w(y, kw) = [ [tebe 5 (2,8) Gla, k) 
SR 


where Sp = {x : |x| = R}. Now let R > oo. Using the asymptotic behavior 
(1.34) and (1.38) for G(y,2,k) and its radial derivative (with x and y inter- 
changed and w replaced by 9) and the asymptotic behavior, for |2| = R — ov, 
a= Ro, 


234 9. Scattering by Obstacles 


eikR en ikR 
w(x, kw) ~ a(—w,6,k) — a_(w,6,k), 
(3.11) ikR —ikR 
aa kw) ~ ik a(—w, 0, k) + ik —— a_(w,6,k), 
with 
(3.12) a_(w,0,k) = a(w,6,k), 


as in (1.46)—-(1.47), we see that the integrand in (3.10) is asymptotic to 


2tk 


1 —_. 
(3.13) TR? a 


_(w, 0, k)us(y, kw) + o(R~?), 


(the terms involving e?**” canceling out), so passing to the limit R — oo gives 


(3.9) and proves the proposition. 


We can rewrite the formula (3.7) as 


(3.14) S(k) =I+ Ath), 

with 

(3.15) A(k) f(w) = [e670 dé. 
S2 


Note that unitarity of $(k) on L?(S7) is equivalent to the identity 


1 k 


(3.16) 5; LA(w)* — A(A)] = Ah)" ACB), 


that is, to the integral identity 


(3.17) ; [a(0,w,k) — a(w,0,k)] = 


a 


a(n,w,k)a(n, 0, k) di. 


S2 


The special case of this where w = @ is known as the optical theorem: 


k 
(3.18) Im a(w,w,k) = ee / |a(n, w, k)|? dn. 
ag 
S2 


It is useful to know integral identities for the scattering amplitude. We note one 
that follows from the characterization 


3. The scattering operator 235 
(3.19) v(r0, kw) ~ rte" a(—w,0,k), 17 00 
and the integral identity (a consequence of Green’s identity) 


dv 


WD (y, kw)| dS(y). 


O 
(3.20) v(x, kw) = / [oly, kw) (@,y,k) — g(x, y,k) 
y 
OK 


We evaluate the integrand on the right as x = r@, r — oo. Using (1.36), that is, 
(3.21) g(x,y, k) ~ —(4ar)te*™e 94) og = 70, r > 00, 


we find from (3.19) and (3.20) that 


1 Ov Jon 
a(w,6,k) = —— | ethey — ko ag 
(w,0,k) = — 2 = () 
OK 
(3.22) 
$e f cy Puy, kw) dS(y) 

4n Ov”? 

OK 


The first term on the right side of (3.22) can be written as 


(3.23) a (v(y) - 0) &O-9Y ds(y) = ny - Ax (k(w — 6)), 


OK 


where, for € € R°, 


(3.24) Ax(€) = ; v(y) e*Y dS(y). 
OK 


The function A K(E) clearly extends to an entire analytic function of € € C?. For 
€€ IR® tending to infinity, one can (typically) find the asymptotic behavior of 
Ax (€) via the stationary phase method. Note that 


(3.25) Ax (0) = 0. 


One way of writing the last term in (3.22) is the following. For any real k;, 
or more generally for Im k > 0, define the Neumann operator N(k) on f € 
H'(OK) to be the value of 0v/Ov in L?(OK), where v is the unique solution to 
the scattering problem (1.1)—(1.3). Define the functions eg on 0K by 


(3.26) ee(y) =e'™§, yedK. 


Then the last term in (3.22) is 


236 9. Scattering by Obstacles 


1 
(3.27) a (N Hen €n0) pany: 


Consequently, the formula for the scattering amplitude can be written as 


1 
—— (Nk) enw; €x0) 12(0K): 


(3.28) a(w,0,k) = xy : Ax (k(w —6))+ if 


We will investigate the Neumann operator further in §7. 
We can produce a variant of the formula (3.22) by using G(z, y, k) instead of 
g(x, y, k) in (3.20). We then get 


OG 
(3.29) v(a, kw) = — / e they “~ (2, y,k) dS(y). 
OVy 
OK 
Using the limiting behavior for G(x, y, k) as |z| — 00, which follows from (1.34), 
we have 


(3.30) a(w,6,k) = _ hey (y,k0) dS(y). 
OK 


If we write u(y, k0) = e~**?¥ + u(y, kO), this becomes a sum of two terms. The 
first is identical to the first term in (3.22), while the second differs from the second 
term in (3.22) precisely by the replacement of (w,@) by (—@,—w). From this 
observation, we can derive the following identity, called the reciprocity relation: 


(3.31) a(w,0,k) = a(—0, —w, k). 


To see this, it suffices to show that k(w+0)-Ax (k(w—0)) = 0. Since w+ 0 and 
w — @ are orthogonal for unit w and 0, this is equivalent to the observation that 


(3.32) Ax(E) is parallel to €, for € € R°, 


and this follows easily from Green’s theorem. 


Exercises 


1. Show that (3.5) follows from 


ic _(@, Oy(leDuny,d & = us (@, o(leDurl® 4, 


which in turn follows from (2.28). 

2. Fill in the details on the identities (3.25) and (3.32) for Ax (€), and then on the reci- 
procity relation (3.31). What is the intuitive content of (3.31)? 

3. If you set S_ = 6_©%4, obtain an analogue of (3.7), with a(w,6,k) replaced by 
a_(w,6,k). 


4. Connections with the wave equation 237 


4. Incase f = Be aes with corresponding scattered wave v, show that the flux ® 
studied in Exercise | of §1 is given by 


o(w,k) = lim rk / |v(a, kw)|? dS (a) 


|z|=r 


= nk f Ja(—w, 8, b)) do. 
S2 


We call o(w,k) the scattering cross section. Using the optical theorem and the reci- 
procity relation (3.31), show that 


o(w,k) = —4n? Im a(w,w, k). 
5. Generalizing (3.22), show that, for f © H*(OK), 
(3.33) Bx(k)f(r0) ~ rte” Ax (k)f(8) + o(r*), 
as r — oo, where 


1 


~ ar 


(3.34)  Ax(k)f(0) / oY Tik(v(y) - 0) f(y) +N(k)F(y)] dS (y). 


aK 
6. Make a parallel study of the scattering operator for H = —-A+V, V € C§°(R?), real- 


valued, using results from the exercises in §§1 and 2. To begin, use the unitary operators 
6. :H. — L?(R?) to construct S = 64 6* . Show that, parallel to (3.22), 


1 ~iko- 
a(—w,8,k)=—2- f V(u)u(y,kw)e™ dy, 
or equivalently, 


1/2 


(3.35) a(w,0,k) = -(4) P(k(6 — w)) — Ff Vevret. rape ip. 


4. Connections with the wave equation 


The initial-value problem for the wave equation on R x Q, with Dirichlet bound- 
ary conditions on R x OK, is of the following form: 


(4.1) —_ — Au=0, 


(4.2) u(0,2) = f(x), us(0, 2) = g(a), 
fort € R, x € Q, with 


(4.3) u(t,c) =0, fora € OK. 


238 9. Scattering by Obstacles 


As we know, given f € Hj(Q), g € L?(Q), there is a unique solution u belong- 
ing to C(R, H4(Q)) NC'(R, L*(Q)) to this problem, given in terms of functions 
of the self-adjoint operator A on L?(Q), with domain Hj (Q) M H?(Q), as 


(4.4) u(t, z) = (costA) f(x) + (A7‘ sin tA)g(z), 
where 
(4.5) A = (-A)\/? 


is the unique nonnegative, self-adjoint square root of —A. Recall that the domain 
of A is precisely D(A) = H4(Q). Alternatively, we can write 


“ (2) -009() 


where U(t) is the one-parameter group of operators on Hj (2) 6 L?(Q) given by 


costA A-tsintA 
(4.7) = e sintA  costA ) 


Using either of the unitary operators 


(4.8) 6.1 1°(Q) 3 PR’), 


we can write 


(cos tA) f = ®L* cost|E| Gx f, 
(4.9) 


(A~tsintA)g = ®j'|€|~' sin t|é| O49. 


Note that 6 also provide isomorphisms 


(4.10) $s : HY(Q) — L?(R?, (£)%dé). 


The group U(t) is not a uniformly bounded group of operators on the Hilbert 
space Hj (Q)@L7(Q). Indeed, with f = 0, we see from (4.4) that the best uniform 
estimate on ||u(t, -)||p2(@) is 


(4.11) Ilu(t, -VIlz2@a) < lel Ilgllze@y- 


There is another Hilbert space on which U(t) naturally acts as a group of unitary 
operators, namely the space 


(4.12) E=HOL(Q), 


where H is the completion of H}(Q) with respect to the norm given by 


4. Connections with the wave equation 239 
(4.13) Ile = IAFIBaqay = f IVP@)P ae. 
Q 


(Recall that IIfllzacay = |I fll Z2cqy + AF IlZ2(qy) If we equip 1 with this norm, 
then ® extend to unitary operators 


(4.14) 64 :H — L?(R°, |€|7d€). 


Since unitary operators are special, it is natural to use the Hilbert space (4.12) 
rather than Hj(Q) @ L?(Q). We will denote an element of E by (f,g); 
f © H, g € L?(Q). When U(t) is applied, this is treated as a column vector, 
as in (4.6); we will also use the column vector notation for elements of € when 
convenient. 

Elements of H need not belong to L*(), though they do belong to L7,,(). 
In fact, if B is a bounded subset of ©, the estimate 


(4.15) lull z2¢B) < Callulla 


can be established by the argument used to prove Proposition 5.2 in Chap. 4, 
provided A has nonempty interior. Since clearly [ 8 |\Vul? dx < i |Vul? dz, 
we hence have 


(4.16) lull (By < Callulla- 


Further estimates are given in the exercises. 
The unitarity of U(t) on E reflects the conservation of total energy, given by 


(4.17) E(u(t)) = || (a, uz)||2 = [(iSeut.29P + ws (t, x) |?) dz. 
Q 


There is also the notion of local energy, given as follows. For a bounded subset B 
of , set 


(4.18) Ep(u(t)) = J (iSeut.29P + fun(t,2)|2) der. 

B 
Using the absolute continuity of the spectrum of A on L?(Q) established in §2, 
or more precisely, the absolute continuity of the spectrum of a related operator 
specified below, we will establish the following result on local energy decay. 


Proposition 4.1. Given (f,g) € E, (u, ue) = U(t)(f, 9), we have 


(4.19) Ep(u(t)) — 0, as |t| —> oo, 


240 9. Scattering by Obstacles 


for any bounded B <Q. 


Before starting the proof of this proposition, we will make some further com- 
ments on the infinitesimal generator of the unitary group U(t) on €. This is a 
skew-adjoint operator, and it has the form 


0 1 
(4.20) B= ( f 7 


where, for f € D(A) C H, 
(4.21) Af =—Af 


in the distributional sense. Then B? is a self-adjoint operator of the form 


ape fAr o 
(4.22) B =( ane 


where Aj is self-adjoint on H, Ag is self-adjoint on L?(Q), and they both satisfy 
(4.21), on their respective domains. Note that the unitary operators 


64 :H®L?(Q) — L?(R®, |€|? dé) @ L?(R?) 


intertwine (4.20) with multiplication (on each factor) by |€ |? and 


D(@1B°Oz') = L?(R®, |€|? (€)* dé) ® L?(R®, (€)* dé). 


In particular, the operators A, and A» have only absolutely continuous spectrum. 
Let 


(4.23) Lj = Ai? 
be their unique nonnegative, self-adjoint square roots. Both L, and L2 are inter- 


twined via ®, with multiplication by |€|, so we can identify them, denoting them 
by L, and if (u, uz) = U(t)(f, g), we have 


u(t) = (costL)f + (L~* sintL)g, 


(4.24) : 
uz(t) = (—LsintL) f + (costL)g. 


We now begin the proof of Proposition 4.1. Since U(t) is unitary and 
Ep(u(t)) < E(u(t)) = ||(u, uz) |/2, we see that it suffices to prove the proposi- 
tion for (f, g) in a dense subset of €. In particular, we will take 


(4.25) fEDh)CH, g€D(L2) c L7(Q). 


4. Connections with the wave equation 241 


Lemma 4.2. If f and g satisfy (4.25), then, as |t| > 00, 


u(t) —+ 0 weakly in D(L,) and 


4.26 
snee ut(t) —+0 weakly in D(L2). 


Proof. Fix wo € D(L£1), wi € D(L2). Note that 


(4.27) ®, f € L?(R%, |€/?(€)7dé), 


and so on, so using the images under ®+ to justify the inner-product calculations, 
and noting that, by (4.27), 


(4.28) LfEe HQ), UVfe LQ), Lge L(9) 
(and similarly for wo, w1), we obtain 


(u(t), 0) a4) = (Lu(t), Lwo) ¢ + (u(t), Wo) 


(4.29) 
= (L?u(t), L?wo) -» + (Lu(t), Lwo) ,2- 


To examine each term, write (with 7 = 1 or 2) 
(Liu(t), L’wo) 2 = ((L costL) f + (14 sintL)g, L’wo) , 
(4.30) = | (costa) d(FyL’ f, Li wo) 


+f (sin tA) d(F\L7~*g, L7 wo), 
0 


where fF’) is the spectral measure of Lo. In light of (4.28) and the absolute con- 
tinuity of Fy, it follows that d(F\L/ f, L wo) and d(F\L/~'g, L4 wo) are finite 
measures on R that are absolutely continuous with respect to Lebesgue measure. 
Hence (4.30) is the Fourier transform of an L!-function on R. Thus the Riemann— 
Lebesgue lemma implies that this tends to 0 as |t| > oo. Similarly, 


(4.31) (ue (t), 1) p¢L9) = (Lu,(t), Lw1) 2 + (us (t), w1) L2" 
This time, to examine each term, write (with 7 = 0 or 1) 


(Li u4(t), L4w1) ,. = ((—L?*" sin tL) f + (LJ costL)g, L/wy) ,. 


(4.32) = | (sin tA) d(F,L!*" f, Low) 


+f (cos tA) d(Fy\L’g, L’w). 
0 


242 9. Scattering by Obstacles 


Again the Riemann—Lebesgue lemma applies, and the proof of Lemma 4.2 is 
complete. 


To derive local energy decay from this, we reason as follows. For any R < oo, 
set 


(4.33) Or = {x EN: la] < R}. 
Then, for f € H,iftirf = Flac by (4.16) we have 


(4.34) ler f\lat(@ag) < Crilf ilo. 


Similarly, for any f € D(L1), 


(4.35) ler llz2@an) S Call flloc,)- 


Thus, restricted to Qr, u(t) is bounded in H?(Q,) and u;(t) is bounded in 
H'(QpR), for t € R, given the hypothesis (4.25) on the initial data. Thus these two 
families of functions on Q are compact in H1(QR) and L?(Q_R), respectively, 
by Rellich’s theorem. The weak convergence to zero of (4.26) hence implies the 
strong convergence to zero: 


(4.36) u(t) — 0in H'(Qr), u(t) — Oin L?(Op), 


as |t| > oo, whenever f and g satisfy (4.25). Proposition 4.1 is hence proved on 
the dense set given by (4.25), and as we remarked before, that proves it in general. 

Instead of representing (f,g) € € as a pair of functions, L? with respect to 
different weights, via ®, it is often convenient to use the following construction, 
of Lax—Phillips. Namely, for f,g € C§°(Q), define UV (f,g) on R x S? by 


ve (1) (4) = op a [f la) TcC@, Fa) de 


+ gee | o(e 0a Ra) ae. 
Q 


(4.37) 


This is the same as the (formally computed) €-inner product 


(4.38) ((f, 9); (Us(-,k,w), ikUs(-,k,w))) 0, 


times 2~!/2(27)—8/?. Note that e’*'U.(a, k,w) solves the wave equation, with 
Cauchy data (U(x, k,w),ikU+(a2,k,w)). In terms of the operators ®_, studied 
before, we can write (4.37) as 1/ »/2 times 


(4,39) k? (4 f)(kw) + ik(P.9) (hw), for k > 0, 
k?(@5f)(kw) +ik(@zg)(kw), fork <0. 


4. Connections with the wave equation 243 


Note that f€ H Oxf € 17(R%, é[2dé) [EP f € L?(R%, |é|-2d8), oF, 
switching to polar coordinates, 


(4.40) fH = Fk? (Oxf)(kw) € L?(Rt x S?, dk dw). 
Similarly, 
(4.41) g € L?(Q) => k(®ag)(kw) € L?(R* x $7, dk dw). 


Therefore, for (f,g) € €, the quantity (4.39) belongs to 


(4.42) L?(R x S?, dk dw) = L?(R,N), 
with 
(4.43) Mais"). 


We can now establish the following. 


Proposition 4.3. For each choice of sign, V+ provides a unitary map of E onto 


L(N). 


Proof. It is clear that the restrictions of V4 to H @ 0 and to 0 6 L?(Q) are both 
isometries, by the arguments leading to (4.40) and (4.41). Also, it is easy to see 
that the images of these spaces under V are mutually orthogonal, so V+ is an 
isometry of € into L?(IR, NV). To show that it is surjective, we show how to solve 
for (f,g) € € the pair of equations 


(4.44) Oi. (Lft+ig)=uo, (Lf —ig) = w, 


for arbitrary uo, ui € L?(IR%). Inverting the unitary operators ®. and ®+, we 
reduce this to a trivial system for LD. f+-ig and Lf —ig, easily solved for f € H,g © 
L?(Q), since L : # — L?(Q) is an isomorphism. This proves the proposition. 


The maps W.. intertwine the evolution group U(t) with a simple multiplication 
operator: 


Proposition 4.4. We have, for p € L?(R,N), 


(4.45) Us U(t) Uy" y(k,w) = e ™ y(k,w). 


Proof. This follows directly from the intertwining properties of ®., given (4.39) 
and the following computation: 


244 9. Scattering by Obstacles 


k? (@u(t)) (kw) + ik(®tur(t)) (kw) 
= k? [(cos kt) Bs f + (k~* sin kt)®+g] 
+ ik|—k(sin kt) ®, f + (cos kt)®49] 
=e" Siftike ™ Oxg, 


(4.46) 


for k > 0, with a similar computation for k < 0. 


The unitary maps discussed above are called “spectral representations” for 
U(t). In §6 we will study related maps, called “translation representations.” Note 
that in the case K = 0, the functions U+(a, k,w) become Up(x, k,w) = e7*”'®, 
and both spectral representations coincide. We denote this free-space spectral rep- 
resentation by Wo. It is a unitary map of Ep) = Ho @ L?(R?) onto L?(R, NV), given 
in terms of the Fourier transform by 


tk 


(4.47) Wo () (k,w) = ie) + Fake). 


Here, Ho is the completion of C§°(R*) with respect to the norm ||V || 72(as), 
mapped unitarily by the Fourier transform onto L?(R°, |€|?dé). 


Exercises 


Let y € S(R) be an even function in the following exercises. Let A = /—A, as in 
(4.5), and let K(x, y) be the Schwartz kernel of y(A), as in (2.28). Let Ao be the 
free-space Laplacian on R?, Ao = V—Ao, and let K is (x, y) be the Schwartz kernel of 
(Ao), so, parallel to (2.28), 


K(x, y) = (2n)73 / eFC (lel) de. 


R3 


Let Dy(x,y) = Ky(z,y) — K2 (x,y), where Ky (zx, y) is set equal to 0 if « € K or 


ye K. 
1. Use the formula 
1 co 
4.48 A)= =| p(t) costA dt 
(4.48) p(A) Pa a 


together with finite propagation speed to show that 
supp (t) C {|t] < T} => supp Dy(x,y) C {lal, lyl < R+T} 


if K C Br(0). 
2. Use (4.48) to show that, for some J = J(a, 3), 


|D¢ DE Ky(x,y)| < C[llllz1(@ + |D? Allz3@]; 


forz,y € 2. 
3. Use Exercises 1 and 2 to show that when y € S(R) is even, then Dy(x, y) is rapidly 
decreasing and is the Schwartz kernel of a trace class operator on L?(R*). 


5. Wave operators 245 


4. Let H'(IR”) denote the completion of C§°(IR”) with respect to the norm in (4.13). 
Show that if n > 3, there is a natural injective map 


LHR") — S/R") 
and the Fourier transform maps 1‘ (IR”) isomorphically onto 
FH'(R") = {u € Li.(R") : glu(é) € L7(R")} = L*(R", |é|?dé). 


5. Show that, for n > 3, 
L*(R" |glPd&) C Lj. (R", d8), 
provided 1 < q < 2n/(n + 2). Conclude that ifn > 3, any @ € FH'(R"”) can 
be written as a sum of an element of L?(IR") and a compactly supported element of 
L"(R"), given g € [1, 2n/(n + 2)). 
Show that L?(IR?, |€|?dé) is not contained in Lj, (R?). 
6. Let wo (€) be the Fourier transform of (x) ~7. Show that if g € [1, 2), then 


9 © Lap(R”) => vo *9 € L?(R"), 


provided o > (2 — q)n/2q. (Hint: Interpolate between easy cases.) 
7. Show that ifn > 3 ando > 1, then 


(4.49) H'(R”) C L?(R”, (x)? dr). 


Note that this extends the estimate (4.15) in several ways. 
8. Show that ifn > 3, 


(4.50) CR cre AR. 


Show that this result implies (4.49). 
Reconsider this problem after reading §2 of Chap. 13. 


5. Wave operators 


In this section we examine the asymptotic behavior of the unitary group U(t) on €, 
as t > oo. More precisely, we show that, as t + too, U(t)M,Uo(—t)(f, 9) 
converges to a limit, W.(f,g); the operators W are called wave operators, 
and they are easily seen to be isometries from € into €. Here, € is the space 
constructed in §4 for Q = R?\ K, & that for the region Q9 = R°, and 
Uo(t) the “free-space” evolution operator for R?; M, is multiplication by a 
function » € C%(IR?), equal to zero in a neighborhood of K, and equal to 1 
outside a bounded set. We will show that Wz have as right inverses operators 
Q+ = VO Dy, where YW are the unitary operators constructed in §4; Vo is the 
corresponding operator constructed for Q9 = R°. Since OQ are unitary, it will 
follow from this that the wave operators are also unitary. 

We begin with the following observation, a simple consequence of Huygens’ 
principle. Suppose f and g are in C§°(R3), supported in Be = {x € R®: 
|x| < R}. Then, for |¢| > R, 


246 9. Scattering by Obstacles 
(5.1) Uol(t)(f,g) =0, for |x| < |t|— R. 


This follows directly for the formula for the fundamental solution to the wave 
equation on R x R®°, which, recall from Chap. 3, is 


62) nie2) = lal I, 


Consequently, if kK C Br and if f and g are supported in Br,, then 
(5.3) U(s)Uo(—s)(f,g) = U(R+ Ro)Uo(—R—- Ro)(f,g), fors > R+ Ro, 


with a similar identity for s < —R — Ro. We can insert an M,, between the two 
unitary factors on the left if g(x) = 1 for |z| > R, without altering anything. It 
follows that 


65.4) W-(f,9) = lim U(-t)M,Vol2)(f.9) 


exists, for (f, g) in the dense subset of Eo consisting of compactly supported func- 
tions. Consequently, the limits exist on all of € , and the operators W +, called 
wave operators, are isometries from Eg into €. A major result, established below, 
is that these operators are actually unitary, from €9 onto €. 

In fact, consider the following operators: 


(5.5) OQ, = Up Ws E — &. 


By Proposition 4.3 we know (4 are unitary. We aim to establish the following 
result. 


Proposition 5.1. We have 


(5.6) Q4W4 =1 and Q-W_ =Ton &p. 


In order to prepare to prove this, we introduce the following set of initial data 
for the wave equation. If FR is sufficiently large that kK C Br, set 


Dg (R) = {(f,9) € Cee (R*) © Cp(R®) : Uo(t)(f, 9) = 0, 


(5.7) 
fort >0,|z|< R+t}. 


In particular, f and g vanish near K, and we can regard (f, g) as an element of € 
or of €, and 


(5.8) — (f,g) € Do (R) => Uolt)(f, 9) = U(t)(f,g), fort > 0. 


Clearly, 


5. Wave operators 247 
(5.9) Uo(t) Dg (R) C Dj (R), fort > 0, 


though not for t < 0. Also, by the argument involving Huygens’ principle dis- 
cussed above, it is clear that 


(5.10) U Uo(t) Dg (R) is dense in Ep. 
t<0 


Note that (5.8) implies 

(5.11) W,=I onD{(R). 

Our first step in establishing Proposition 5.1 is the following. 
Lemma 5.2. We have 

(5.12) QO. = 7 en Dye). 

Proof. This is equivalent to the identity 

(5.13) UV, =WVo on Di (R), 

which in turn follows from the identity 

(5.14) ((f,9),(Vi(-, k,w), kV, (-,k,w))) - = 0, 


fork € R,w € S?,(f,g) € Di (R). (Here V, is the function defined in (1.51).) 
Note that the left side of (5.14) is equal to e**t times 


(5.15) (Vo(t)(f,.9), (Vi, tkV4)) 0, 


for any t > 0. We will show that, for ¢ large, this can be dominated by a small 
quantity. Indeed, an examination of (u(t), we(t)) = Uo(t)(f,g) via the formula 
(5.2) for the Riemann function shows that, for ¢ large and positive, V,,u(t, x) is 
approximately radial, and u(t, 7) ~ u(t, #). Thus (5.15) is equal to 


(5.16) J [uett Dies + (ék)u;(t,2)V,| de + o(1), 


Q 


as t —+ +00. In light of the radiation condition for V,, the two terms in this 
integral cancel out, up to a remainder that vanishes as t — +00; this proves the 
lemma. 


In view of (5.11), we now know that 


(5.17) 0,W,=I on Df (R). 


248 9. Scattering by Obstacles 


Now it follows easily from the definition that 


(5.18) W..Uo(t) = U(t)Wx, for all t, 


and from Proposition 4.4 it follows that 


(5.19) QU(t) = Up(t)OQ+, for all t. 


Given that (5.17) holds when applied to Uo(t)(f,g), provided this belongs to 
Dj (R), we deduce that 


(5.20) 04W+(f,9) = (f,9), for (f,9) € Uo(—t) Dg (R), t > 0; 
in other words, 


(5.21) 04,W, =I on |) Uo(-t)DF(R). 
t>0 


In light of (5.10), this implies that Q,W = J on €, establishing the first identity 
in (5.6). The second identity is proved in the same fashion, and Proposition 5.1 
is done. 

The unitarity of 24 then gives the following result, known as the completeness 
of the wave operators. 


Corollary 5.3. The wave operators Wx are unitary from Ey onto E. We have the 
identities 


(5.22) Wa = ULV. 


Note that (5.6) implies the surjectivity of 0, hence of V+, since the invert- 
ibility of Yo is obvious (Gust the Fourier inversion formula). Thus the proof of 
Proposition 5.1 contains an alternative proof of Proposition 4.3, and hence of 
Proposition 2.7. 

The operator 


(5.23) S) =Wy'W_ = 05' (U4 UL")W, 


a unitary operator on €, is often called the scattering operator. In view of the sim- 
ple nature of Wo, it is equally convenient to call the unitary operator on L?(R, NV): 


(5.24) S=V,U_!, 
also a scattering operator. Note that, if we make the identification 


L?(R,N) = L?(R* x $7) L?(R~ x $”), 


5. Wave operators 249 


and follow with the natural unitary map L?(R* x S$?) + L?(R®) involving polar 
coordinates, we can write 


6,67! 0 
2 = . 
ou! 2 ( 0 2) 


The operator 5 = ®, 6—' is the scattering operator studied in §3; the other oper- 
ator, &_o;' = S_, appears in Exercise 2 of §3. 

Another consequence of the unitarity of the wave operators is the following 
nontrivial variant of (5.10). 


Proposition 5.4. Pick R so that K C Br. Then 


(5.26) U U(t) Dg (R) is dense in E. 
t<0 


Proof. Any (f,g) € € can be written in the form 


(5.27) (7,9) = W..( fo, 90); 


with (fo, go) € Eo. Approximate (fo, go) to within ¢ in the €9-norm by (fi, 91) € 
C§° (R?) 6 C§°(R3). Then, for all t > R + Ro sufficiently large, 


(5.28) W(fi,91) = U(—t)Uo(t) (fi, 1), 


by (5.3) and (5.4), and by the Huygens principle argument given there, for any 
such t = to, 


(5.29) Uo(to) (fi, 91) = (fa, 92) € Df (R). 


Since (f, g) — U(—to)(f2, g2) has E-norm less than ¢, the proposition is proved. 


We can also produce a formula for W_ | of a form similar to (5.4) but not 
involving an arbitrary choice, for example, of M,,. Note that there is a natural 
isometric mapping 


(5.30) J:E—>+ & 


defined on (f, g) by extending these functions to be zero on K. We have tacitly 
used this before. We now establish the following. 


Proposition 5.5. For any (f,g) € E, 


(5.31) We'(f,9) = lim Uo(—-t) JUS, 9)- 


Proof. For simplicity we analyze W7'. By (5.28), for (fi,91) € C§°(R°) @ 
C§° (R°) supported in Br,, we have 


250 9. Scattering by Obstacles 

(5.32) JU (t)W(fi, 91) = Uol(t)( fi, 91), 
for allt > R-+ Ro. This is equivalent to 

(5.33) Uo(—t) JU (t)( fs, 93) = WE" (fa, 93), 


for (f3, 93) = Wifi, 91), t > R+ Ro. This gives (5.31) on a dense subset of €, 
hence on all of € in view of the uniform boundedness of U(t) and Uo(t). 


Exercises 


The following exercises deal with the existence and completeness of Schrédinger wave 
operators: 


(5.34) Wif = lim eit g—itHo ¢ 


t— oo 


where Hp = —A, H = —A + V, acting on functions on R”. 
1. Show that Ws € £(L7(R”)) exists provided that, for each f € C§°(R”), 


(5.35) i Ve *¥0 Fl dt < co. 
0 


(Hint: eV e~" 40 f = i eT Ve—'s4o f ds.) Note that when W- exist, they are 
isometries (i.e., ||W+f]| = || f|| for all f € L?(R")). 

2. Show that f € C§°(R") implies ||e**”° f ||. < C(t)~"/?. Deduce that W4. exists if 
V € L?(R"). (Hint: e*4.6(2) = (4rit)~"/2e7!2"/4%* ) 

3. Show that if g = 2/(1 — 0) € [2, 00), then f € CG°(R”) implies 


lle"? fll zany < el) aes 


Deduce that W_ exists if V € L"(R”), with r < n. In particular, W+ exists provided 
|V(x2)| < C(a)~?, o > 1. 

4. Show that, for any f,g € L?(R"), (g,e7""" f) — Os |t| — oo. Use this to show 
that if g; is an eigenfunction of H, then (e~**g,,e""° f) — 0 as |t| > 00, for all 
f € L?(R"). Hence, for W+ given by (5.34), R(W+) C He. 

5. Suppose V € C§°(R?), so we have ®4 by Exercise 5 of §2. Let Bo be the in- 
verse Fourier transform. Show that ®2+W®) ' commutes with multiplication by 

e'sll” for all s € R. Hence it commutes with p(|é|) for all g € C.(R). (Hint: 
We _ ee tWye 8 ) 

6. When the conditions of Exercise | hold, show that 


Weis lim a cottteith _—itHo dt 
and hence that 
(5.36) ((Ws —I)f,g) = lim i(e"F VeHM0 f glen dt. 


6. Translation representations and the Lax—Phillips semigroup Z(t) 251 


7. Choosing the + sign, show that the integral on the right side of (5.36) is equal to 


[Lf EDGvOle we pe) WH ae a at 


(5.37) 
= | [OV e) [eto — le? + te)" 10) Jur @H ae a, 


(Hint: Use ©. to intertwine e*’” with e*”°.) 


8. If V € C§°(R®), show that the limit of (5.37) as € \, 0 is equal to 


~ik|e—y| 
5 | ff CDOV@) — Hoe ay ae a, 


provided f € C§°(R*) and (@4g)(E) is supported on |€| € [a,b] CC (0, 00). Here 
k, = |€|. Using (1.58), write this as 


= / ji (Bg © voy. Fly) dy dé = —(©4 f, 4g) + (Gof, O49). 


9. Using the previous exercises, show that, given V € C§°(IR?) (real-valued), we have 
(Wf,9) = (®of, ®+g) for all f,g € L*(R?), hence 


Ws = O]' So. 


Deduce the completeness of the wave operators: R(W+) = He. 


Compare arguments in Chap.5 of [Si], dealing with a larger class of potentials. 
Completeness for a nearly maximal class of potentials to which Exercise 3 applies is 
treated in Chap. 13 of [RS]. Long-range potentials are treated in Chap. 3 of [Ho]. 


6. Translation representations and the Lax—Phillips 
semigroup Z(t) 


From the “spectral representations” UV, : € + L?(R,N) defined in §4, which, 
as shown in Proposition 4.4, intertwine U(t) with multiplication by e~***, we 


construct “translation representations,” unitary operators 


(6.1) Th E> VR), 


by taking the Fourier transform with respect to k: 


(6.2) ya () (s,w) = (2n)71/? qe efks Da ) (k,w) dk. 


Consequently, Proposition 4.4 implies 


252 9. Scattering by Obstacles 


(6.3) TU ()T 'f(s,w) = f(s —t,w). 


The operators 7; are useful for exposing various features of U(t), and we ex- 
plore this in the current section. We begin with a look at the free-space translation 
representation 7o, a unitary map from €9 onto L?(R,N) given by using Vo in 
(6.2). 

We can produce an explicit formula for 79 using the formula (4.47) for Vo, 
which we recall is 


(6.4) oN? te ¢ ) (k,w) = k? f(kw) + ikg(kw). 


The formula for 7p is expressed naturally in terms of the Radon transform, which 
is defined (initially for f € S(IR*)) by 


(6.5) Rits)= [fly ay). 


Yyw=s 


for s € R, w € S?. Note that the Fourier transform can be expressed as 


(6.6) f(kw) = (onys? f e "Ri (s,w) ds: 
Thus, taking the inverse Fourier transform in k, we have 
(6.7) Rf(s,w) = (ony? e'*s f (kw) dk. 


In light of this, we see that taking the Fourier transform with respect to & of (6.4) 
gives 


(6.8) To () (s,w) = LORI (s,w) + O,Rg(s,w)]. 


The unitarity of 7g gives rise to the inversion formula 


Fle) = 5— f blew) de 
20 
(6.9) ( 
g(a) = — 5. f ako w,w) dw, 
S2 


for (f, g) in terms of 


(6.10) (sy) = To (4) (6,0), 


6. Translation representations and the Lax—Phillips semigroup Z(t) 253 


This result is related to the Radon inversion formula, 


(6.11) f(x x | FR (x -w,w) dw 


~ 82 


which can be deduced from (6.9), or directly from (6.6) and the Fourier inversion 
formula. 

In view of (6.3), for 7p, we see that the solution to the free-space wave equation 
ute — Au = 0 with initial data (f, g) can be written as 


(6.12) u(t, xv) = ~ k(a-w—t,w) dw, 


where k(s,w) is given by (6.10). More fully, by (6.8), 


(6.13) u(t,x) = [erste -w—t,w)+OsRo(x-w —t,w)] dw. 


S2 


82 


Note that if f and g are supported in Br, = {|x| < Ro}, then, by (6.5), Rf(s, w) 
and Rg(s,w) vanish for |s| > Ro. Therefore, Rf(x-w—t,w) and Rg(x-w—t, w) 
vanish for |z| < |¢] — Ro. Thus from (6.13) we rederive the Huygens principle, 
that u(t, x) vanishes for |z| < |t] — Ro in this case. 

Use of Jo and 7+ will augment arguments involving the Huygens principle 
made in §5. We introduce the space 


(6.14)  D*+(R)={(f,9) € €o: Uolt)(f,g) = 0 fort > 0, |x| < R+ th. 


Note that Df (R), defined by (5.7), consists of the elements of D*(R) that are 
smooth and compactly supported. Similarly, set 


(6.15) D-(R) ={(f,9) € €o: Uolt)(f, 9) = 0 fort <0, || < R+ lth}. 


For R = 0, we denote these spaces simply by D* and D~, respectively. From 
(6.12) it is clear that if 79(f, g)(s,w) is supported in s > R (resp., s < —R), then 
(f,g) belongs to D*(R) (resp., D~ (R)). Furthermore, the converse result is true: 


Proposition 6.1. The transformation To : Ey) — L?(R,N) maps D*(R) (resp. 
D~(R)) onto the space of functions in L?(IR, N) supported in |R, ) (resp., sup- 
ported in (—co, —R]), for any R > 0. In particular, D* and D~ are orthogonal 
complements of each other in Eo. 


In order to prove this proposition, it suffices to demonstrate that if (f,g) € Eo 
belongs to D*, then k(s,w) = 7o(f,g) vanishes for s < 0. This comes down to 
showing that, if k € L?(IR,V) and if the integral (6.12) vanishes for t > ||, then 


254 9. Scattering by Obstacles 


k(s,w) = 0 for s < 0. Applying a mollifier, we can suppose k € C™(R,NV). 
Since 7p clearly commutes with rotations, it suffices to prove this for k(s, w) of the 
form k(s,w) = K(s)y(w), where ¢ is an eigenfunction of the Laplace operator 
on S$”. So suppose 


(6.16) u(t, 2) = ~ / K(a-w—t)p(w) dw 
S2 


vanishes for t > ||. Since this implies D¢u(t,0) = 0 for t > 0, for all a, we 
have 


(6.17) 0 = al°lK(-t) [row dw, t>0, 
S2 


for all a. Since, by the Stone-Weierstrass theorem, {w} has dense linear span in 
C($7), there exists a such that the integral in (6.17) is nonvanishing. This implies 
that alelK(—t) = 0 fort > 0, so K(t) coincides with a polynomial in t for t < 0. 
Since K € L?(R), this implies k(t) = 0 for t < 0, and the proposition is proved. 

Now we look at the maps 7+ : € > L7(R, N), in the presence of an obstacle 
K, which we suppose is contained in a ball Br. Note that D*(R) can be regarded 
as subspaces both of €) and of €. Lemma 5.2 (specifically (5.13)), which was 
important in the last section, immediately implies the following. 


Proposition 6.2. We have 
(6.18) T, = To on Dj (R) and T_ = Tp on Dp (R). 


The potential usefulness of this is indicated by the next result. 


Proposition 6.3. The properties (6.3) and (6.18) uniquely characterize T. and 
T— as continuous linear maps. 


Proof. Equations (6.3) and (6.18) specify 7; on U(t)D¢ (R) for all t € R. By 
Proposition 5.4, the union of these spaces is dense in €, so the result follows for 
T,. The proof for 7_ is similar. 


Note that we can set 7; = Tj on D*(R) and since U(t) = Up(t), for t > 0 
on D*(R) and for t < 0 on D~(R), we can extend 7x. so that (6.3) holds. The 
uniqueness result above then implies 7, = 7+, so we have 


(6.19) T, = To on Dt(R) and T_ = 7p on D~(R), 


sharpening (6.18). 

If we use the translation representations 7 in place of the spectral represen- 
tations U.., the scattering operator S defined by (5.24) is replaced by the unitary 
operator on L?(R,.N): 


6. Translation representations and the Lax—Phillips semigroup Z(t) 255 
(6.20) S$ =T,T21. 


The operator s clearly commutes with translations. It also possesses the following 
important property. 


Proposition 6.4. We have 
(6.21) S : L?((—00, —R],N) —+ L?((—00, R],N). 


Proof. T—~' maps L?((—oo,—R],N) onto D~(R), which is orthogonal to 
D*(R), as a consequence of Proposition 6.1. Since 7, maps D*(R) onto 
L?([R, 00), NM) and is unitary, it must map D~(R) into the orthogonal comple- 
ment of L?([R,00),.\V); this proves (6.21). 


Now the action of S on L?(R,N) is given by multiplication by a unitary 
operator-valued function S(k), similar to the action of S in terms of S(k) dis- 
cussed in §3. The action of S on L?(R,N) is then given by convolution by an 
operator-valued tempered distribution S(s), the Fourier transform of S(k). From 
(6.21) we conclude that S(s) is supported in the half-line (—0o,2RJ. It follows 
that S(k) extends to be a holomorphic, operator-valued function in the half-space 
Im k > 0, a fact that can also be seen directly from an analysis of the scattering 
amplitude a(w,0,k), in view of the relation established in §3. We will study the 
meromorphic continuation of these objects into the lower half-plane in 87. 

We now look at a semigroup of operators, introduced by P. Lax and R. Phillips, 
defined as follows. Fixing R such that kK C Br, set 


(6.22) K = (Dt(R)®D-(R)), 

the orthogonal complement in €. For t > 0, define 

(6.23) Z(t) = PrU(t) Pr, 

where Px is the orthogonal projection of € onto K. 

Proposition 6.5. Z(t) is a strongly continuous semigroup of operators on K, so 
(6.24) Z(t+s) = Z(t)Z(s), fort,s > 0. 


Proof. If (f;,9;) € K, then U(t)(f1,91) € D*(R) for t > 0, and furthermore 
U(—s)( fo, 92) € (D~(R))~ for s > 0. Hence, for s,¢ > 0, 


(6.25) (U(—s)( fa, 92), PcU(t) (Ji, 91))¢ = (U(—s)(f2, 92), U (8) (Fis 1) 


Thus Pc U(s)PcU(t)Px = PrU(s + t) Px, which implies (6.24). The strong 
continuity is obvious. 


256 9. Scattering by Obstacles 
We note that the Lax—Phillips semigroup Z(t) can also be expressed as 


(6.26) Z(t) = P,U(t)P_ (t>0), 


where Px is the orthogonal projection of € onto Lo= (R)) + To see this, note that 


Pr = P,P_ = P_P,. 


Since U(t) leaves D*(R) invariant, P,U(t)P, = P,U(t), for t > 0. Similarly, 
P_U(t)P_ = U(t)P_, fort > 0, so 


PxU(t)Px = P_P,U(t)P,P_ 
= P_P,U(t)P_ 
= P, P_U(t)P_ 
= P,U(t)P_. 


(6.27) 


Since Z(t) is a strongly continuous semigroup on XK, it has a generator C, 
whose resolvent is given by 


(6.28) (A—C)'= | e Z(t) dt, RedrA>0. 
0 
The following result gives important spectral information on Z(t). 
Proposition 6.6. For any T > 2R, > 0, 
(6.29) (A — C)!Z(L) is compact. 


We can derive this from the following result, of independent interest. Given 
p € C§°(R*), let 


(6.30) Z(p) =|} p(t) Z(t) dt. 
0 

Define U(p) and Uo(p) similarly. 

Proposition 6.7. If p € C§°((2R, co)), then 


(6.31) Z(p) = Px [U(p) — Uo(p)] P-- 
Proof. Since it is easy to see that 
(6.32) P,Upo(t)P_ =0, fort > 2R, 


this is clear from the formula (6.26). 


6. Translation representations and the Lax—Phillips semigroup Z(t) 257 


Now to prove (6.29), it suffices to show that Z(p) is compact for any p € 
C§& ((2R, 0o)), since the operator (6.29) is equal to [5° e~' Z(t + T) dt, which 
is anorm limit of such Z(p). We show that, for such p, U(p) — Uo(p) is compact, 
from € to €o. Indeed, if p is supported in [2R,T], then, by finite propagation 
speed, 


(6.33) [U(p) — Uo(p)] (f, 9) is supported in |a| < 2R+T, 


for any (f,g) € €. Also we have, for such p, by integrating by parts, and elliptic 
regularity, 


(6.34) U(p) :E 4+ C™(Q), Up(p): € + C™(R®). 


The compactness of U(p) — Uo(p) then follows, by Rellich’s theorem. We note 
that complementing (6.33), we also have, for any (f,g) € E, 


(6.35) [U(e) — Uo(p)] (Fg) depends only on (f,9)| 5," 


For any nonzero a € C in the spectrum of the operator (6.29) (for a fixed 
» > 0, T > 2K), this compact operator has an associated finite-dimensional, 
generalized a-eigenspace V,,. Z(t) clearly preserves V.,, for t > 0, and the spec- 
trum of Z|). consists of e“/', where, for each such a, ju; is a finite set of 
complex numbers, each satisfying 


(A= py) 1 et =a. 

We call the set of all such z;, as @ ranges over the nonzero elements of the spec- 
trum of (6.29), scattering characters. It is a fact that this set coincides precisely 
with the spectrum of the generator C of Z(t), but we will not make explicit use 
of this and we do not include a proof. (See [LP1].) By the analysis above, the set 
of scattering characters j1; can be characterized as follows: 
(6.36) point spec Z(t) = {e!#" : yz; scattering character}. 
In 87 we relate the set of scattering characters to the set of scattering poles. 

We end this section with some comments on the semigroup Z(t) in the trans- 
lation representation, that is, we look at 
(6.37) Ei = GAT. 
acting on C4 C L?(R,NV), where 
(6.38) Ky =Ty(K). 


Note that (f, g) belongs to K if and only if 


258 9. Scattering by Obstacles 
(6.39) supp 7+(f,9) C (—0o, R] and supp 7—(f,g) C [—R, co), 


in view of Proposition 6.1 and (6.19). Recalling the scattering operator S, given 
by (6.20), we see that 


(6.40) Ky ={f € L?((-co, RI, N) : Sf € L*([-R,00),N)}. 
By (6.37) and (6.3) we have, for f € K4, 


Z(t) f(s,w) = f(s—t,w), fors< R, 


(6.41) 
0, fors > R. 


Exercises 


1. Prove the Radon inversion formula (6.11) from the definition (6.5) and the Fourier in- 
version formula. 
2. Consider a first-order, constant-coefficient PDE 


) 
Fp = Alou, u(0,2) = f(a), 

ot 
where A(D,,) is an € x & matrix. Assume the principal symbol Ai(€) has £ distinct 
imaginary roots for € € R® \ 0. Express the solution in terms of the Radon transform. 
When can you deduce Huygens’ principle? 


7. Integral equations and scattering poles 


In $1 we established results on the existence and uniqueness of solutions to the 
scattering problem 


(A+k*)v=00n92, v=fondK, 
(7.1) Ov 
r(5- ~ tke) — 0, ar-o. 


As in (1.19), let us denote the solution operator to (7.1) by 
(7.2) v = B(k)f. 


We established the proof that B(k) is uniquely defined, for k € R, via the limit- 
ing absorption principle in $1; related is the elementary fact that such a solution 
operator is also uniquely defined for complex k such that Im k > 0, since k? 
belongs to the resolvent set for the Laplace operator on Q (with Dirichlet bound- 
ary condition) for Im k > 0. The limiting absorption principle implies that B(k) 


7. Integral equations and scattering poles 259 


is strongly continuous in {& € C : Imk > 0}; of course, it is holomorphic on 
{k:Imk > 0}. 

Here we will show that v = B(k)f can be obtained as the solution to an 
integral equation over OK’. Use of such integral equations is a convenient tool for 
a number of investigations in scattering theory. We use it here to show that B(k) 
has a meromorphic continuation to an operator-valued function on C, with some 
poles in {k : Imk < O}. These poles are known as scattering poles and provide 
fundamental objects for study in scattering theory. 

The integral equations applying to (7.1) will be obtained from a study of the 
following operators, called single- and double-layer potentials, respectively: 


(73) Se) ste) = f fly) gle.u.k) atu) 
OK 

and 

74) Du(hys2) = f fy) FXCa.u.k) a5(u), 
OK 


where, as in §1, 
(7.5) g(a, y,k) = (4n|z — yl)" ete, 


For f € C°(0K), or even for f € L1(0K), the functions (7.3) and (7.4) are well 
defined and smooth for z € R* \0K = QUK, where K is the interior of K’. For 
such v, x € OK, we denote by v+(x) the limit from the exterior region 2, v_ (a) 
the limit from the interior region K, and by 0v/Ov, and Ov/Ov_ their normal 


derivatives, in the direction pointing into Q, taken as limits from 2 and from Kk, 
respectively. By the methods used to treat layer potentials in §11 of Chap. 7, one 
derives the following results: 


SUK) f(a) = SU(k) f(a) = Gk) f(2), 
(7.6) 1 1 
De(k) fa(2) = £5 f(x) + SN(K)F (2), 


where, for x € OK, 


(7.7) Gk) fa) = f Fy) 9(e,y.b) asty) 
OK 


and 


260 9. Scattering by Obstacles 


(7.8) Hay =2 | $0) gy (x,y, k) dS(y). 


Note that, for |x — y| <1, g(x,y, &) has an estimate of the form 
(7.9) J9(a,y,k)| < Cela — yl. 


We have for V,,g the poorer estimate |V,,g(z,y,k)| < C,|a — y|~?, but the 
normal derivative Og/Ov, has a weaker singularity on 0K x OK, of the same 
kind as g: 
og 

( 


(7.10) = 2, Y, k)| < O|x—y|-!, fora,y € OK. 
OVy 


It follows that G(k) and N(k) are compact operators on L?(0K), for each k € C, 
with holomorphic dependence on k. 

We will first consider the possibility of obtaining the solution v to the scattering 
problem in the form 


(7.11) v= B(k)f =De(k)g onQ, 
where g (whose dependence on k we suppress) satisfies the identity 
(7.12) (1+ N(k))g = 2f ondKk. 


We will establish the following result. 


Proposition 7.1. The operator I+N(k) is invertible on L?(OK) for alllmk > 0, 
and for all real k, except for k = .;, where —AF is an eigenvalue for A on the 


O° 
interior region K, with Neumann boundary condition on OK. 


Proof. Since N(k) is compact, it suffices to consider whether I + N(k) is injec- 
tive. Suppose therefore that 


(7.13) (I + N(k))g =0, 


and consider v = Dé(k)g on R? \ OK. On Q, v satisfies (7.1), with f = 0 (for 
real k, and it is also exponentially decaying as |x| — oo if Im k > 0), so the 
uniqueness result implies that v = 0 on 2. Thus 0v/Ov+, = 0. Now an analysis 
of the double-layer potential (7.4), parallel to that for (11.39) of Chap. 7, shows 
that, in general, 

ODL(k)f  OD(k)f 


(7.14) i OK. 


Hence, for v = Dé(k)g, with (7.13) satisfied, we have 


7. Integral equations and scattering poles 261 


(7.15) OY 3 ite 
Ov 


Thus v satisfies the homogeneous Neumann boundary condition, together with 
the PDE 


° 


(7.16) (A+k?)v=0 onk. 


Since, by (7.7), the jump of v across OK is g(x), and since v; = 0, we deduce 


that v_ = —g, so v is not identically zero in K if g A 0. The spectrum of the 
Laplace operator A on K, with Neumann boundary condition, is a discrete subset 
of {A+} of R~, so the proposition is proved. 


The extension of B(k) to a neighborhood of the real line in C, including the 
exceptional points A; defined above, is neatly accomplished by considering the 
following alternative integral equation. Namely, we look for a solution v to the 
scattering problem of the form 


(7.17) v = B(k) f = De(k)g + inS(k)g inQ, 


where g is to be determined as a function of {. Here 7) is a real constant; we can 
take 7 = +1. In this case, we require that g satisfy the identity 


(7.18) [I + N(k) + 2inG(k)] 9 = 2f. 


Proposition 7.2. For a given real 1 # 0, the operator I + N(k) + 2inG(k) is 
invertible on L?(OK), for all k such that 


(7.19) Imk>0 and 7 Rek = 0. 


Proof. Again it suffices to check injectivity. Suppose g € L?(0K) satisfies 


(7.20) [I + N(k) + 2inG(k)] 9 = 0, 
and let 
(7.21) v = Dé(k)g + inSe(k)g in R* \ OK. 


Then v satisfies (7.1) (for k real, also with exponential decay for Im k > 0) on Q, 
with f = 0, so our familiar uniqueness result implies v = 0 on 2, hence vi = 0 
and 0v/Ov, = 0 on OK. Hence, as before, by (7.6)-(7.8), 


(7.22) v_ = -g onOK. 


262 9. Scattering by Obstacles 


Similarly, 0v/Ov_ is equal to the jump of Ov/Ov across OK. To calculate this 
jump, we use (7.14) for Dé(k)g, and for inSé(k)g, we use the identity 


OSE(K) 9,» lta. — 
(7.23) Bu, 0) = gh" eo + 9), 
where 
O 
724 N#(Rg(2) =2 f alu) F2(e,u.4) dy), ve OK. 
OK 


Consequently, complementing (7.14), we have 


(7.25) OStE)G SOON) a g on dK. 
OV Ov_ 


Therefore, for v given by (7.21), we have 


(7.26) OW 2 gyg on OW 
Ov_ 


Hence, on the interior region, v satisfies 


(7.27) (A +k2)u =00K, a — inv = 0 on OK. 

Given that 7 4 0, we claim that this implies v = 0 on K . Indeed, Green’s identity 
implies 

(7.28) IVellzzcey — F*llellZece) = —enllullzacaxy- 

Taking the imaginary part of this identity, we have the following. If k = A+ ip, 
(7.29) 2Nallvliz2ac) = —allvllz2(ar): 

Under the hypotheses (7.19), the coefficients on the two sides of (7.29) have op- 


posite signs, so v = 0 on OK. In view of (7.22), this implies g = 0, so this 
proposition is proved. 


Taking 7 = +1, we have I + N(k) + 2iG(k) invertible in the first (resp., sec- 
ond) closed quadrant in C, hence invertible in a neighborhood of such a quadrant. 
Thus B(k) is extended to an operator-valued function holomorphic on a neighbor- 
hood of the closed upper half-plane Im k& > 0. 

We next show that, in fact, 6(k) has a continuation to a meromorphic operator- 
valued function on C. This is an immediate consequence of the following result. 


7. Integral equations and scattering poles 263 


Proposition 7.3. The operator I + N(k) is invertible on L?(OK) for all k € C 
except for a discrete set, and (I + N(k))~! is a meromorphic function on C. 


This result in turn is a special case of the following elementary general result. 
For more general results in the area of analytic Fredholm theory (including the 
multiparameter setting), see Theorem 7.9 and Propositions 7.10—7.11 in Appendix 
A, Outline of functional analysis. 


Proposition 7.4. Let O be a connected open set in C. Suppose C(z) is a com- 
pact, operator-valued, holomorphic function of z © O. Suppose that I + C(z) is 
invertible at some point po € O. Then I + C(z) is invertible except at most on a 
discrete set in O, and (I + C(z))~! is meromorphic on O. 


Proof. The operator J + C'(z) fails to be invertible at a point z € O if and only 
if the compact operator C(z) has —1 in its spectrum. For a given z € O, let 7 
be a small circle about —1, disjoint from the spectrum of C'(zo). For z in a small 
neighborhood U of z , we can form the projection-valued function 


(7.30) P(z) = aq [O- C(z)) dd. 


For z € U, this is a projection of finite rank (say ¢); using P(zo) we can pro- 
duce a family of isomorphisms of the range R(P(z)) with R(P(zo)), and then 
C(z)P(z) can be treated as a holomorphic family of @ x ¢ matrices. This propo- 
sition in the case of € x @ matrices is easy, via determinants. By hypothesis, —1 
is not identically an eigenvalue for this family, so 


(I+ C(z))*P(2) 
is a meromorphic function on U/. Clearly, 
(I+ C(z))-"( — P(z)) 
is a holomorphic function on U/, so this establishes the proposition. 


Corollary 7.5. The solution operator B(k) for (7.1) has a meromorphic continu- 
ation to C; all its poles are in the lower half-plane Im k < 0. 


This follows from the formula 
(7.31) B(k) = 2Dé(k)[I + N(k)] 
except at the real points k = ;, from Proposition 7.1, together with the formula 


(7.32) B(k) = 2[De(k) + inSe(k)] [I + N(k) + 2inG(k)], 


for 7 = +1, which defines B(k) as holomorphic on a neighborhood of the 
real axis. 


264 9. Scattering by Obstacles 


The poles of B(k) are called scattering poles. It follows immediately from 
(7.31) that the set of scattering poles is contained in the set of poles of [J + 
N(k)]~* within the lower half-plane Im k < 0. In fact, these two sets coincide; 
this is a consequence of the following. 


Lemma 7.6. If lm k 4 0, then Dé(k) : L?(OK) > L7,.(Q) is injective. 


loc 


Proof. The argument used in the proof of Proposition 7.1 shows that if g € 
L?(0K) and Dé(k)g = 0 onQ, then g = ti| age where v zg is an eigenfunction for 


O° 
A on K, with Neumann boundary condition on 0K, and with eigenvalue —k?. 
Since the spectrum of this elliptic operator is real and nonpositive, the lemma is 
proved. 


Proposition 7.7. The set of scattering poles is precisely equal to the set of poles 
k, for [I + N(k)] — such that Im k < 0. 


Proof. If [J + N(k)|~' has a pole of order m at k = k;, Im k; < 0, then there is 
an element h € L?(0K) such that, with nonzero h,, € L?(0K), 


(7.33) [T+ N(k)]*h = (b— ky)~™ [Pm + (& — kj )am—a + ++°]. 


Since Dé(k;)hm = bm # 0 in LZ,,(Q), it follows that, for k near k;, 


loc 


(7.34) B(k)h = (k — kj) "™ bm + O((K— kj"), kk 


9 
so B(k) is singular at k;. 
We also have the following characterization of scattering poles. 


Proposition 7.8. A complex number k, is a scattering pole if and only if there is 
a nonzero v € C™(Q) satisfying 


(7.35) (A+k3)v=OonQ, v=OondK, 
of the form 
(7.36) v= Dé&(k;)g, 


for some g € L?(OK). 


Proof. We know that, for Im k; > 0, v satisfying (7.35)-(7.36) must vanish on 
Q. On the other hand, if Im k; < 0, we know that k; is a scattering pole if and 
only if J + N(k;) has nonzero kernel. We claim that, for Im k; < 0, 


(7.37) Dé(k;) : ker(I + N(k;)) —> {v satisfying (7.35)-(7.36)}, 


isomorphically. Indeed, surjectivity is obvious, and injectivity follows from 
Lemma 7.6. This proves Proposition 7.8. 


7. Integral equations and scattering poles 265 


The condition (7.36) can be viewed as an extension of the radiation condition, 
which we initially defined for real k. A sharper result is given in Proposition 7.13 
below. 

It is clear that the Green function G(x, y, k), defined in §1 by (1.26)-(1.30), 
has a meromorphic extension in k, with poles confined to the set of scattering 
poles defined above. Indeed, we can write 


where g(x, y,k) is given by (1.5) for all k € C, and yy,, is the restriction of 
g(x,y,k) tox € OK. Similarly, the “eigenfunctions” u(x, kw), defined by 
(1.32)-(1.33), have such a meromorphic continuation in k, and so do the scat- 
tering amplitude a(w, 0, k) and the scattering operators S'(k) and S(k). We will 
explore these last objects further at the end of this section. First we consider an- 
other integral-equation approach to the scattering problem (7.1). 

As another alternative to (7.11), it is of interest to obtain solutions to the scat- 
tering problem in the form 


(7.39) v = Bk) f = Sée(k)g onQ, 
where g satisfies the integral equation 
(7.40) G(k)g = f on OK. 


The operator-valued function G(k) is defined by (7.6). As we have noted, G(k) 
is compact on L?(0K). In fact, analysis done in Chap.7 shows that G(k) is a 
pseudodifferential operator of order —1 on OK, and examination of its symbol 
shows that it is elliptic. The principal symbol of G(k) is positive on S*(OK). 
Consequently, for each k € C, each real s, 


(7.41) G(k) : H*(0K) —+ H**1(0K) is Fredholm, of index zero. 
In analogy with Proposition 7.1, we have the following result: 
Proposition 7.9. The operator G(k) : H*(OK) > H**+(0K) is invertible for 


all k such that Im k > 0, and for all real k, except for k = 1; such that — ps isan 


O° 
eigenvalue of A on the interior region K, with Dirichlet boundary condition on 
OK. 


Proof. In view of (7.41), it suffices to check the injectivity of G(k). This goes 
as in the proof of Proposition 7.1. Setting v = S(k)g on R® \ OK, uniqueness 
as before yields v = 0 on Q if g € ker G(k), Imk > 0. Then v_ = 0 on OR, 
by (7.6), while by (7.25) 0v/Ov_ = g on OK, so if g # O then | ye # Ois 


fe} 
an eigenfunction for A on K, with Dirichlet boundary condition and with eigen- 
value —k?. 


266 9. Scattering by Obstacles 


In addition to (7.41), we obtain from the analysis of G'(k) as a pseudodifferen- 
tial operator that its principal symbol is independent of k, hence 


(7.42) G(k) — G(0) = D(k) : H*(0K) —> H*t?(OK). 

By Proposition 7.9, G(0) is invertible. Then 

(7.43) G(0)"'G(k) = 1+ G(0)""D(k) : H*(0K) — H°(0K) 
is holomorphic in k, and 

(7.44) G(0)-!D(k) : H®(€K) — H**1 (dk); 


in particular, this operator is compact on H*(OK), for each s > 0. Since 
Proposition 7.9 implies that the operator (7.43) is invertible for Im k > 0, we 
can apply the general operator result of Proposition 7.4, to obtain: 


Proposition 7.10. The operator-valued function 
(7.45) G(k)~' : H**1(0K) —> H°(0K) 


has a meromorphic continuation to C, with poles contained in Im k, < 0 together 
with the set 1; of real numbers specified in Proposition 7.9. 


In view of (7.39), the set of poles of (7.45) satisfying Im k < 0 contains the set 
of scattering poles, and 


(7.46) B(k) = S&(k) G(k)*, 


where G(k)~! is regular. In fact, in parallel with the proofs of Lemma 7.6 and 
Proposition 7.7, we easily obtain the following: 


Proposition 7.11. /f Im k # 0, then Sé(k) : L?(0K) > L7,.(Q) is injective. 


loc 
Therefore, the set of scattering poles is precisely equal to the set of poles for 


G(k)~! such that Im k < 0. Furthermore, a complex number k; is a scattering 
pole if and only if there is a nonzero v € C™(Q) satisfying (7.35), of the form 
(7.47) v = Sl(k;)g, 

for some g € L?(OK). More precisely, for Im k; < 0, 

(7.48) Sl(k;) : ker G(k;) —> {vu satisfying (7.35) and (7.47)}, 
isomorphically. 


From the formula (7.7) for G(k), we see that 


7. Integral equations and scattering poles 267 
(7.49) G(k)* = G(—-k). 


We therefore have the following: 


Corollary 7.12. The set of scattering poles is symmetric about the imaginary 
axis. 


We can also obtain a characterization of the set of scattering poles which is 
more satisfactory than that of Proposition 7.8 or the last part of Proposition 7.11. 


Proposition 7.13. A complex number k is a scattering pole if and only if there is 
a nonzero v € C™(Q) satisfying (7.35), of the form 


(7.50) v= Del(k)gi + S€(k)g2 onQ, 
for some g; € L?(OK). 


Proof. For v of the form (7.50), note that 


(7.51) v4 = =(1+ N(k)) 91 + Glk)go on OK. 


| NOR 


In particular, if v; = 0 on OK and k is not a scattering pole, but Im k ¥ 0, then 
gi, = —2(1 + N(k))~!G(k) go. Now we know that, forIm k > 0, vs = 0 ondK 
implies v = 0 on Q, so we have the identity 


(7.52) 2De(k) (I+ N(k)) 


G(k) — S&(k) =0, 

for Im k > 0, as a map from L*(0K) to C%(Q). This identity continues analyti- 
cally to the lower half-plane Im & < 0, excluding the scattering poles, and implies 
that if v is of the form (7.50), v; = 0 on OK, and k is not a scattering pole, then 
v = 0 on YQ. Given the results of Propositions 7.8 and 7.11 when k = k; is a 
scattering pole, this finishes the proof. 


We can obtain a few more conclusions from (7.52), which we write as 
(7.53) De(k)M(k) = S(k) onQ, 


valid for all & € C at which J + N(k) is invertible, with 


(7.54) M(k) =2(I+ N(k)) 


G(k). 

First, using the injectivity of D@(k) for Im k < 0, as in the proof of Proposition 
7.7, we see that M(k) has an analytic continuation to all Im k < 0, including 
the set of scattering poles. The only poles of M/(k) are at the real numbers ,; 
of Proposition 7.1. Also, M(k) is invertible, except at the real numbers j1; of 
Proposition 7.9; in particular, M(k) is invertible at all the scattering poles. There- 
fore, when k = k; is a scattering pole, M(k;) gives an isomorphism from ker 


268 9. Scattering by Obstacles 


G(k;), in (7.48), to ker (J + N(k;)), in (7.37). Furthermore, any v of the form 
(7.50), with k = k;, can be written both in the form (7.36) and in the form (7.47) 
(with different g’s). 

Another calculation using the representation of the solution to the scattering 
problem by a single-layer potential (7.39)-(7.40), produces an analysis of the 
Neumann operator \V(k), which we define as follows, first for Im k > 0. For 
f € C™(OK), let v be the solution to the scattering problem (7.1), v = B(k)f, 
and define 

Ov 


By elliptic regularity estimates, we can deduce that, for s > 1, 
(7.56) N(k) : H*(0K) —> H*"'(OK). 


We produce a formula for A’(k) using the representation v = Sé(k)g, g = 
G(k)~'f, valid for Im k > 0. From the formula (7.23) for OS0(k)g/Ov+, we 
see that 


(7.57) N(k) = =(N#(k) — IG(k)“?, 


NlrR 


for Im k > 0. This identity continues analytically to the complement of the set of 
poles of G(k)~! in C. Note that, complementing (7.49), 


(7.58) N#(k) = N(—-k)*, 


so (7.57) can also be written as 


-1 


(7.59) N(-k) = (20) — 1)“ G(k)] = 


By the analysis of the scattering problem for Im k > 0, we know that V(k) is a 
strongly continuous function of &, with values in the Banach space 


L(H*(OK), H*-'(0K)), 


for Im k > 0. Thus N(k) does not have poles on the real axis; such singulari- 
ties are therefore removable on the right side of (7.57). The poles of G(k)~! on 
the real axis must be canceled by a null space of N#(k) — I, fork = \j;. The 
occurrence of these real poles of G(k)~+ makes (7.57) a tool of limited value in 
analyzing the Neumann operator \V(k) for real k. 

We can produce another formula for \/(k), first for Im k > 0, by using the 
representation (1.4) for v = B(k)f, that is, 


(7.60) B(k)f = Dek) f — SC(k)N(k) f. 


7. Integral equations and scattering poles 269 


Evaluating this on 0K, we have 


(I+ N(k))f — G(K)N(k)f, 


Nl eR 


(7.61) i= 
which implies 
(7.62) Nik) = 5k) *((h) —I), 


for Im k > 0. Of course, this identity also continues analytically to all k € C 
outside the set of poles of G(k)-". Comparing (7.62) with (7.57), we see that 
N(k) and N*(k) are related by the identity 


(7.63) N(k)G(k) = G(k)N*(k), 
for all k € C. Also, comparing (7.62) with (7.59), we see that 
(7.64) N(-k) =N(k)*; 


in particular, N’(k) is self-adjoint when k is purely imaginary. Furthermore, in 
view of (7.60) and (7.57), we see that the set of poles of \’(k) coincides exactly 
with the set of scattering poles. 

Note that the factor (1/2)(V(k) — I) in (7.62) arises from evaluating Dé(k) f 


on OK as a limit from the interior region K, by (7.7). Thus the analogue of the 


identity (7.53) which is valid on K is obtained by replacing M(k) by N(k)71. 
Equivalently, 


(7.65) Dé(k) = SE(k)N(k) on K, 


where \V/(k) is the exterior Neumann operator defined above. 

So far we have not established that there actually are scattering poles. We will 
show that in fact there are infinitely many scattering poles on the negative imag- 
inary axis, for any nonempty smooth obstacle K, by a study of G(k). We begin 
with the following result: 


Lemma 7.14. For real s > 0, G(is) is positive-definite. 


Proof. Given g € L?(0K), set v = S0(is)g on R® \ OK. Then Green’s theorem 
gives, for s > 0, 


; Ov 
(Av, ¥) z2@ + I|dvllz2() =— / G(is)g Bux dS, 
(7.66) P 


Ov 
(Av, v)12(K) + Ildullz2 (x) - i G(is)g Ov_ ae: 
aK 


270 9. Scattering by Obstacles 


Recall from (7.25) that Ov/Ov_ — Ov/Ov4 = g, so adding the identities above 
gives 


(G(i8)9, 9) zaaxy = §?(llelay + llelac) 
+ [[Vollz2@y + IVellz2ce), 


(7.67) 


for s > 0, which proves the lemma in this case. Since we know that G(0) is 
invertible, this is also positive-definite. 


To proceed with the demonstration that G(is)~+ 


negative real s, we set 


is singular for infinitely many 


(7.68) n(s) = # negative eigenvalues of G(is), 
for s < 0. Our next claim as follows: 
Lemma 7.15. As s \, —0co, n(s) > ov. 


Proof. We will show that (G (is) 91, 92) defines a negative definite inner product 
on a vector space V whose dimension can be taken large with |s|. Then the lemma 
follows, by the variational characterization of the spectrum of G(is). Pick points 
p,q € K such that |p — q| is maximal. Then, for any N, you can pick p; near p 
and q; near q, for 1 < 7 < N, such that 


(7.69) min {|p; — q;|:1 <j < N} > max {|p; — gu : 9 FF}. 


Put small disjoint disks D; about p;, Di about q;, all of the same area, within 
OK, and define functions g; € L?(OK) by 


(7.70) gj =10nD;, —1lon Di, 0 elsewhere. 


Then {g; : 1 < 7 < N} isa set of orthogonal functions, all of the same norm. 
Let V be the linear span of these g;. With V so fixed, of dimension N, a simple 
calculation gives 


G(is)g9;,9;) < —y <0, 
(7.71) ( ( )g; 93) x | 
| (Glis)9;, 9%)| <<, for 5 # k, 
for s large and negative (because |x —y|~ ‘els! !*—¥! is maximal for x and y distant, 
if you exclude a small neighborhood of x = y), and the lemma follows. 


In view of (7.41), only finitely many of the eigenvalues of G(is) (all of which 
are real) can cross from positive to negative at any point s = sz, so we have the 
following conclusion from the last two lemmas. 


7. Integral equations and scattering poles 271 


Proposition 7.16. The operator-valued function G(k)~* has an infinite number 
of poles on the negative imaginary axis, each of which is a scattering pole. 


As we have already mentioned, the scattering amplitude a(w, 0, &) and also the 
scattering operators S(k) and S(k) have meromorphic continuations, with poles 
confined to the set of scattering poles. Indeed, by (3.28), a(w, 0, k) is the sum of 
an entire function and 1/47 times 


(7.72) (N(k)eiw, €k8) p2(9K)) 


where e¢(y) = e'’’§ for y € OK. We now draw a connection between the set of 
poles of S(k) and the set of scattering characters, described in §6 in terms of the 
spectrum of Z(t). 

First note that since S(k) is unitary for k real, we have 


(7.73) S(k)S(k)* = S(k)*S(k) =I, 


for k in a neighborhood of the real axis. By continuation, knowing that S(k) is 
holomorphic for Im k > 0, we see that a complex number / such that Im k < 0 
is a pole of S(k) if and only if S(k) fails to be invertible. Now, by (3.14)-(3.15), 
and its analogue for S(k), we see that, for Im & > 0, 


k 
(7.74) S(k) =I+ ai A(k) 
where, for such k, A(k) is a compact operator on L?(S”), given by a smooth 
integral kernel. Such S(k) is Fredholm of index zero. Thus, for Im k > 0, S(k) 
fails to be invertible if and only if it has a nonzero kernel. Furthermore, this hap- 
pens if and only if S(k)* has a nonzero kernel. We are now prepared to establish 
the following result. 


Proposition 7.17. A complex number u is a scattering character if and only if ips 
is a pole of S(k). 


Proof. j is a scattering character if and only if there exists a nonzero f € Ky 
such that 7, (t)f = e“* f, for t > 0. By (6.41), this implies 


f(s,w) =e “y(w), fors< R, 


(7.75) 
0, fors > R, 


for some nonzero y € L?(S?). By (6.40), such an f belongs to K, if and only 
if S* f is supported in [R, 00). By the Paley—Wiener theorem, we can deduce that 
this will hold if and only if S(k)* f (k) is holomorphic in Im k < 0. Now 


(7.76) f(k) = (20)~ 1/2 ve) 


272 9. Scattering by Obstacles 


which has a pole at k = ip, so this analyticity holds if and only if y belongs to 
the kernel of S(k)*, for k = is. This establishes the proposition. 


For a rich treatment of a great deal more material on scattering poles, we rec- 
ommend the monograph [DZ]. 


Exercises 


1. Verify that G'(k), defined by (7.7), is an elliptic pseudodifferential operator of order —1 
on OK. Compute its principal symbol. 
2. Justify (7.69). 


The following exercises deal with an integral-equation attack on the scattering prob- 
lem for H = —A+ V on R?. Assume V € C§°. We use (1.57), that is, 


(I—V(k))v = RR), 


where V(k) = R(k)(Vv) and 


with 9(2,y, b) = (4n|a — yl)~tel7-Hl, 
3. Show that, forImk > 0, 0 > 1, 


R(k) : Leomp(R°) —> L?(R°, (x)~?? dr) is compact. 
4. Show that, for Im k > 0, k? nota negative eigenvalue of -A + V, ando > 1, 
I—V(k) : L?(R°, (x)~?? de) —> L?(R®, (x)~?? dz) 
is injective, hence invertible. (Hint: If u = V(k)u = R(k)(Vu), show that w satisfies 
the hypotheses for the uniqueness result of Exercise 6 in §1 when & € R. When Im 


k > 0, the argument is easier.) 
5. Fix « € (0, 00). Show that, for Im k > —k, 


R(k) : Lom (R*) — L?(R?,e°*!*! de) is compact. 
Also show that 
I—V(k) : L?(R°, e~?*!*! dx) —> L?(R?, e~?*!*! dr) 


is holomorphic in {k : Im k > —x}, and invertible for Im k > 0, k? ¢ point spec H. 
Deduce that its inverse has a meromorphic continuation. 


8. Trace formulas; the scattering phase 


In Proposition 6.7 we showed that, for any p € C§°((2R,00)), the operator 
Z(p) = tee p(t) Z(t) dt is compact. Recall that the proof used the identity 


8. Trace formulas; the scattering phase 273 
(8.1) Z(p) = Ps[U(p) — Uo(p)]P_, for p € CG? ((2R, ox)). 


We then saw that U(p)—Uo(p) has a smooth, compactly supported integral kernel. 
It follows that the operator (8.1) is not only compact, but in fact trace class. By a 
theorem of V. Lidskii, which we will prove in Appendix A at the end of this 
chapter, it follows that the trace Tr Z(p) is equal to the sum of the eigenvalues of 
Z(p), counted with multiplicity. Thus we have 


(8.2) Tr Z(p) = 9 plips) 


where the sum is over the set of scattering characters, characterized by (6.36). In 
view of Proposition 7.17, we can write 


(8.3) Tr Z(p) = > A(z;), 


poles 


where {z,;} is the set of poles of the scattering operator S(k) (counted with mul- 
tiplicity). 

Using (8.1), we will establish the following formula for Tr Z7(~), which then 
sheds light on the right side of (8.3). 


Proposition 8.1. For p € C§°((2R, c0)), we have 


Tr Z(p) = Tr [U(p) — Uo(p)] 


A 
os =2Tr ico [cos t/—A — costy/—Ao] dt, 


where A is the Laplacian on Q = R° \ K, with Dirichlet boundary condition, and 
Ao the Laplacian on R°. 


Proof. Using the facts that Tr AB = Tr BA and that P; P_ = P_P , we see 
from (8.1) that, for p € C§° ((2R, oo)), 
Tr Z(p) = Tr P_[U(p) — Uo(p)] P+. 


Now for any ¢ > 0, U(t) = Uo(t) on DT, so [U(p) — Uo(p)| P+ = U(p) —Uo(p). 
Similarly, P_[U(p) — Uo(p)| = U(p) — Uo(p), so we have the first identity in 
(8.4). The second identity is elementary. 


Combining (8.3) and (8.4), we have the identity 


(8.5) Tr fo [costV—A — cost./—Ao] dt = ; S- P(z;), 


poles 


valid for any p € C§°((2R,0o)). This identity has been extended to all p € 
C§° (Rt), by R. Melrose [Mel], using a more elaborate argument. 


274 9. Scattering by Obstacles 


Note that (8.4) is equal to the trace of 


(8.6) y(v—A) — 9(/-Ao), 


with y(A) = f(A) + A(—A). It is useful to note that, for any even y € S(R), the 
operator (8.6), given by an integral formula such as in the last line of (8.4) with 
p = , has a Schwartz kernel that is smooth and rapidly decreasing at infinity, so 
that (8.6) is of trace class for this more general class of functions y. (See Exercises 
1-3 from §4.) Recall from (2.7) that if  € C§°(R), then 


(87) v(V=A)o(2) = (2n)-8 / y nig (0, Va UW) OLED dy ae, 
Re Q 


where u,(x,€) are the generalized eigenfunctions of A on © defined by 
(1.32)-(1.33). It follows that, for such y, the trace of (8.6) is equal to 


8.8) jim @n)-* f ollelyralé) dé, 
R3 
with 
(8.9) ra(6) = f (lus(0,8))? —1) de, 
Br 


where we set u4(z,€) = 0, forx € K = R3\ Q, and Br = {x: |x| < R}. 

In order to evaluate (8.9), we will calculate [ |us(a,€)|? dz over Or = {x € 
Q.: |x| < R} via Green’s theorem. Note that since (A + k?)u, = 0, for |€| = k, 
we have 


(8.10) (A +k?) (2) =—2ku,, |g) =k. 


Hence, via Green’s theorem, we have 


2 = 1 / Crus Ous OU 
‘i | ae BuOR Sk Ou) 
QR 0 


QR 


_ .3 i Puy Ou, OU, ds 
~ Ok drok * Ok Or 
|zlJ=R 


(8.11) 


since u, = 0 on OK. We want to evaluate the limit of (8.11) as R + oo. Extend- 
ing (1.41), we can write 


(8.12) u+(r0, kw) = er) + tr Bir, Ou, k), 


8. Trace formulas; the scattering phase 275 
with 
(8.13) Bwr'a(—w,6,k) +r-*a2(—w,0,k) +--+, roo, 
where a(w,6,k) is the scattering amplitude and a; are further coefficients. Dif- 


ferentiating (8.12) yields the following (unfortunately rather long) formula for the 
integrand in (8.11): 


Ou, OU OP uy 
Ok Or  Orok 
—OB OB 
_ 2 4 a(O. : a 
= 2kr(0-w)" +i(0-w) — 2kiB DE + irB Dr 
polrB) 0B OB: OB 
ar" Or Ok” Ordk 


(8.14) ED {2kr|B)? ~ kr Be®*l() +19.) — 1] 
— kr Beale) +] (0 ; w) = (0 ; w)?}} 


O(rB) GB 
Or Orok 


4 eikr[(0-w)+1] {iF le 4) ll =4 


fe erle er r {ine we +i(6- Bh 
r 


A primary tool in the analysis of the integral of this quantity over |z| = R will be 
the stationary phase method, which was established in Appendix B of Chap. 6. 
We make some preliminary simplification of (8.14), using the fact that (8.11) 
is clearly real valued. Also, we can throw out some terms in (8.14) that contribute 
0 in the limit R — oo, after being integrated over |z| = R. This includes all the 
terms in the first set of curly brackets above. Also, a stationary phase evaluation 
of the last two terms in the third set of curly brackets yields a 0 contribution in the 
limit R — oo. Thus, we can replace (8.14) by the real part of 
2 0B. OB 
2kr(0-w)” — 2kiB Ik +irB ar 


(8.15) + {2kr| Bl? + krBe*ro+N (1 — 9)? 


OB OB 
= eihr lw +t] {ane iG 1] + ir - oor — 16. Bh, 
The first term on the right side of (8.15) integrates to 2k times (4/3)7R3, 
exactly canceling out Sel <p dx. The contribution of the second and third terms 
to (8.11) is, in the limit R > oo, 


276 9. Scattering by Obstacles 


(8.16) —iG |a\?, integrated with respect to 0. 


i 
Ok 2k 
We can neglect the second term in (8.16), since it is imaginary. 

Terms in (8.15) appearing with a factor et**"l(?~)+1] have an asymptotic be- 
havior as R — oo given by the stationary phase method, upon integration with 
respect to 0. The leading part in the terms within the first set of brackets is seen to 
be (upon taking the real part) 


2k dai | An 
(8.17) ={f la(—w,0,k)|2d0 + $* Im a(—w, wk), 
S2 


which cancels, by the optical theorem, (3.18). This cancelation is necessary since, 
if (8.17) were nonzero, one would get an infinite contribution to (8.11) as R - oo. 
What gives a finite contribution to (8.11) is the 6-integral of the next leading term 
in this part of (8.15); the contribution to (8.11) one gets from this, as R — ov, is 
(again upon taking the real part) 


(aa2 + Gaz)(—w, 0, k)d6 
(8.18) g? 


4 8 
+ = Im a2(—w, —w, k) + = Re a(—w, —w, k). 
The rest of the terms in (8.15) also give a finite contribution to (8.11) as R — oo, 
via stationary phase, namely —1/2k times 


4 
(8.19) de gt, ae Sy. 


plus a term containing an oscillatory factor e~?’*", which disappears after inte- 


gration with respect to €. This disappearance is guaranteed, since the limit in (8.8) 
as R — oo does exist. Putting together (8.16)-(8.19), we arrive at a computation 
of (8.9). 

All these contributions are expressed in terms of the scattering amplitude a, 
except for (8.18), which involves also the coefficient az appearing in (8.13). Now 
a is related to a in a simple fashion, because (A + k?)(e"*" B) = 0. Expressing 
A in polar coordinates gives a sequence of relations among the coefficients in the 
expansion of B as r — oo. In particular, we get 


(8.20) 2ikag(w,0,k) = Aga(w,9,k), 


where A» denotes the Laplace operator on the sphere {|@| = 1}, applied to the 
second argument of a(w, 6, k). It follows that [ ad2(—w, 0, k)d@ is purely imag- 
inary, so the integral in (8.18) vanishes. In concert with the reciprocity formula 
(3.31), we can deduce that 4ikag(w,w,k) = Aga(w,w,k) + Aja(—w, —w, k). 


8. Trace formulas; the scattering phase 277 


Hence 
aik f ar(w.w, k) dw = ic + Ao)a(w,w,k) dw = 0. 
This disposes of the middle term in (8.18), upon integration with respect to w. 


Thus, in addition to (8.16) and (8.19), the last term in (8.18) remains. 
Consequently, we have 


oe Se 2 
nO // (\w4(a, kw)|? — 1) da dw 


BrxS? 
42 
(8.21) = Re Haw aoe ( w, 0, k) do — ayes wk) 
Ss 
k Oa 
(one yA WwW, wk) ds 


On the other hand, —(1/277)S(k)(dS* /dk) has integral kernel 


—1 1 —k 0G 

Tri Dri) + Sapa OH) 

+ wa f aw.r, k)a(r, 0, k) dr 
9 


(8.22) Z 


+5 fe (w, 7, k) oar, 7) ,k) ar}. 


S2 


Noting that the trace of —(1/277)S(k)(dS* /dk) must also be real, one sees that 
(8.21) is equal to the trace of this operator, which proves the following: 


Proposition 8.2. For even p € C§°(R), 


23) Tr [o(V=8)- o(V=Ae)] = - f * so(de)s!(k) dk, 
with 

(8.24) s(k) = oars (S(k)*S"(k)) = a (S(k)S’(k)*), 
where S(k) is the scattering operator (3.7). 


An equivalent characterization of (8.24) is s’(k) = ds(k)/dk, with 


(8.25) s(k) = a log det S(k) = =. arg det S(k). 
27% 27 


278 9. Scattering by Obstacles 


The quantity s(k) is called the scattering phase. It is real, for k € R, since the 
scattering operator is unitary. To give yet another formulation, if we set 


(8.26) D(k) = det S(k), 
then 
(8.27) s'(k) 1 2 


~ Oni D(k) 


By both (8.24) and (8.27) it is clear that s’(k) extends from k € R to a mero- 
morphic function in the plane, with poles coinciding precisely with the poles of 
the scattering operator and their complex conjugates. For complex k;, one replaces 
(8.24) by ; 
/ T\* oO 
s'(k) = oni Tr (S(k) S (k)). 

As stated, Propositions 8.1 and 8.2 apply in disjoint situations, but note that 
the left side of (8.23) is defined for any even y € S(R) and defines a continuous 
linear functional of such vy. Thus the right side of (8.23) is well defined, at least in 
a distributional sense; in particular, we have s’ € S’(R). Also, replacing p by its 
even part on the left side of (8.5) leaves this quantity unchanged. We deduce the 
following. 


Proposition 8.3. Let p € C§°((2R, c0)). Then 
with 


(8.28) + be) = / * pt(k)s(h) dk, 


the integral interpreted a priori in the sense of tempered distributions. 


In view of (8.27), this identity can be thought of as a “formal” consequence of 
the residue calculus, but a rigorous proof seems to require arguments as described 
above. 

It can be proved that the integral above is actually absolutely convergent. In- 
deed, it has been shown that s(k) has the asymptotic behavior 


(8.29) s(k) = C(vol K)k® + O(k?), ask > 00, inR. 


9. Scattering by a sphere 279 


This was established for K’ strictly convex by A. Majda and J. Ralston [MjR], 
and for  starshaped by A. Jensen and T. Kato [JeK]. We outline a proof for the 
starshaped case in the exercises (with a weaker remainder estimate). 

The result (8.29) was extended to “nontrapping” K by V. Petkov and G. Popov 
[PP] and finally to general smooth K by Melrose [Me3]. Also, results of Melrose 
[Mel1] extend (8.28) to all p € C§°(R*). 


Exercises 


1. Use the formula (8.24) to establish the following formula for s’(k): 


s'(k) =c [ [(e-v)|F* 0)" a dS (x) 


OK g2 


(8.30) 
= / ((w- v)N (keno N’ (keke) par) 4. 
S2 


2. Conclude that if K is starshaped, so one can arrange x - v > 0, then s(k) is monotone. 
3. Set s(k) = —s(—k), k € R, as in Proposition 8.3. Show that if K is starshaped, s’(k) 
is a positive function that defines a tempered distribution on R and hence that s(k) has 
a polynomial bound in k: 
|s(A)| < O(R)™. 


4. Write (8.23) in the form 


Te[e(V=A) ~ o(V=Ba)] = Ff” "(bs al, 


—oo 


for even y € S(R). If K is starshaped, Exercise 3 implies that the integral on the right 


—tk? 


is absolutely convergent. Use y(k) = yi(k) = e and the results on heat kernel 


asymptotics of Chap. 7 to deduce that 
(8.31) tf et” ks(k) dk = (4nt)~?/? vol K + o(t7*/?), 


ast \, 0. 
5. Show that Karamata’s Tauberian theorem (established in §3 of Chap. 8) applies to (8.31) 
to yield 
s(k) = C(vol K)k® + 0(k®), k— co. 


Evaluate C. 


9. Scattering by a sphere 


In this section we analyze solutions to problems of scattering by the unit sphere 
S? C R®, starting with the scattering problem 


(9.1) (A+k?)v=00nQ, v= f on S?, r(0,v — ikv) 4 0, as r > 00, 


280 9. Scattering by Obstacles 


where Q = {a € R?: |x| > 1}, the complement of the unit ball. We start by 
considering real k. This problem can be solved by writing the Laplace operator A 
on R?® in polar coordinates, 


(9.2) A=6 +2r-1d,. +r 7Asg, 
where Ag is the Laplace operator on the sphere S?. Thus v in (9.1) satisfies 
(9.3) r?0?u + 2rd,u + (k?r? + Ag)v = 0, 


for r > 1. In particular, if {y;} is an orthonormal basis of L?(S7) consisting of 
eigenfunctions of Ag, with eigenvalue —j, and we write 


(9.4) (rw) =D a(r)pi(w), 721, 
J 
then the functions v;(7) satisfy 


(9.5) rol (r) + rv; (r) + (kr? — A3)u;(r) =0, r>1. 


As in (1.14), this is a modified Bessel equation, and the solution satisfying the 


radiation condition r(v;(r) — ikv;(r)) + 0 as r — oo is of the form 


(9.6) uj(r) = age */? HO (hr), 


where H, (0) is the Hankel function, which arose in the proof of Lemma 1.2. 
We recall from (6.33) of Chap. 3 the integral formula 


9 \1/2 i(z—mv /2—7/4) co v—1/2 
(9.7) H\)(z) = ( ) cs : | ae (| - —) ds. 
TZ Pas) 0 Qiz 


This is valid for Re vy > —1/2 and —7/2 < arg z < 7. Also, in (9.6), v; is given 
by 
1\1/2 
2 
(9.8) y=(8 +5) - 


The coefficients a; in (9.6) are determined by the boundary condition v;(1) = 


(f, 3), So 


(9.9) aj = 


Using these calculations, we can write the solution operator B(k) to (9.1), v = 
B(k)f, as follows. Introduce the self-adjoint operator 


9. Scattering by a sphere 281 


(9.10) A=(-As+ Me 

so 

(9.11) Ay; = V4%;. 

Then 

(9.12) B(k) f(r0) = r7/?x(A, k, kr) f (0), 


where x(v,k, kr) = HS? (kr) /HS? (k) and, for each k,r, x(A,k, kr) is re- 
garded as a function of the self-adjoint operator A. For convenience, we use the 
notation 


(1) 
9.13 B(k) f(r0) =r73/? Hy sr) 6), 06S. 
(9,13) (k) f (r@) HO (hb) f(9) = 


Similar families of functions of the operator A will arise below. 
Taking the r-derivative of (9.13), we have the following formula for the 
Neumann operator: 


HW'(k) 1 
9.14 N(k)f(0) = |k—4 — — =| (0). 
9.14) (k)F(0) Ay 73/1 
We also denote the operator on the right by kQ(A, k) — 1/2, with 
HS)" (k) 
9.15 ,k) = ——. 


We will want to look at the Green function and scattering amplitude, but first 
we derive some properties of the operators (9.13) and (9.14) which follow from 
the special nature of the operator defined by (9.10). The analysis of the spectrum 
of the Laplace operator on S? given in Chap. 8 shows that 


1 
(9.16) spec A = {m+ 5:m=0,1,2,...}. 
Now, as shown in Chap. 3, ici /2(A) and the other Bessel functions of order 
m + 1/2 are all elementary functions of A. We have 
(1) _ 2A\ 1/2 
(9.17) Aya) = (=) Rm(A), 


where 


282 9. Scattering by Obstacles 


(9.18) Pin{A) = —a( N"(Ga) (FS) 


= Aree (Ae 


and p,,(A) is a polynomial of order m in A, given by 


pada emt (2) tN ym 
k=0 


k\(m — k)! 
(9.19) 
! 
fp ote SOI, 
274 ml! 
Consequently, 
= 1 hn (kr) _m—1.,ik(r—1) Pm(kr) 
(9.20) r 2x(m-+—,k, kr = ang tre 
eS ge) Pm) 
and 
ae L Pin (K) 
(9.21) kQ (m+ 5) = ik (m5) +6 Pate 


Each polynomial p,,(A) has m complex zeros {Gmn1,---,¢mm}, by the funda- 
mental theorem of algebra, and the collection of all these ¢,,; is clearly the set of 
scattering poles for $7. Note that (9.21) can be written as 


1 1 te 
(9.22) kQ (m+ 5) eee (m+ 5) + poe Ga 


We now look at the expression for the Green kernel G(x, y, k) for the operator 
(A + k?)~1, for k real. Thus we look for a solution to 


(9.23) (A+k?)u=fonQ, u=O0ondK, 
satisfying the radiation condition at infinity, given f € CO°(Q). If we write 


(9.24) f(r0) = > filr)e;(9), 


using the eigenfunctions y; as before, and 


(9.25) u(r) = os u;(r)y; (8), 


then the functions u(r) satisfy 


9. Scattering by a sphere 283 


(9.26) ruil (r) + 2rui(r) + (ker? — 3 )u;(r) =Pf(r), >t, 


together with the boundary condition u,;(1) = 0 and, as a consequence of the 
radiation condition, r(u/;(r) —iku;(r)) —* 0.as r — oo. We will write the solution 
in the form 


(9.27) uj(r) = _- Gy, (r,8,k) f;(s)s? ds, 
1 


where the kernel G(r, s,k) remains to be constructed, as the Green kernel for 
the ordinary differential operator 


ee 2d » » POR he 
(9.28) by = Fy +24 (k *). v=(r +3) : 
that is, 
(9.29) Lygv(-,8,k) = 8-75, on(1,00), 


satisfying the boundary condition of vanishing at r = 1, together with the radiation 
condition as r + oo. This operator is self-adjoint on the space L?([1,00),rdr), 
and G(r, 8, k) satisfies the symmetry condition 


(9.30) Gi(r,s,k) = G,(s,7, k). 


Thus it suffices to specify G(r, s,k) for r > s. Since G,(-,s,k) is annihilated 
by L, for r > s and satisfies the radiation condition, we must have 


(9.31) Gi(r,s,k) = c,(s,k)r7/?H (rk), forr > s, 


for some coefficient c,,(s, &) that remains to be determined. In view of the symme- 
try (9.30), c,(-, k) satisfies the same sort of modified Bessel equation, and so is a 
linear combination of s~1/2J,,(sk) and s~1/2HS") (sk). The boundary condition 
gives c,(s,k) = 0 at s = 1, so we can write 


(9.32) c,(s,k) = b,(k)s-"/? (J, (sk) _ arn H{Y(sk)), 


where the coefficient b,,(&) remains to be determined. This can be done by plug- 
ging (9.32) into (9.31), using (9.30) to write G(r, s, k) for r < s, and examining 
the jump in the first derivative of g, with respect to r across r = s. Achiev- 
ing (9.29) then specifies b,,(k) uniquely. A straightforward calculation shows that 
b,(k) is the following constant, independent of v and k, in view of the Wronskian 
relation: 


284 9. Scattering by Obstacles 


sk 1 
(9.33) ih =b= _= 
) Jy (sk) HS" (sk) — J) (sk)HS? (sk) 24 


To summarize, G',(r, 5, &) is given by 


—1/2 J, (k) 1 1 
b(rs) / (J.(sk) = HS (k) HS )(sk)) Hf (rk), r2 5, 

ae 1/2 TAK) p71) (1) 
b(r's) (Ju(rk) = Ona (rk)) HI (sk), r<s. 


In light of this, we can represent the Green kernel for the solution to (9.23) 
satisfying the radiation condition as follows. Using the Schwartz kernel theorem, 
we can identify an operator on functions on Q with a (generalized) function of 
r, 8 with values in the space of operators on functions on the sphere S$”. With this 
identification, we have 


1 
(9.35) G(x, y, k) — —Galt,s,§), 
An 


where |x| = r, |y| = s, and A is given by (9.10). This is also the formula for the 
resolvent kernel of (A + k?)~!, for Im k > 0. 

The formula (9.34) for G_(r, s,&), as a sum of two terms, corresponds to the 
decomposition (1.30) for G(z, y, k), that is, 


(9.36) G(a,y,k) = g(x,y, k) + h(a, y,k), 
where, as in (1.5), 


eikla—yl 


(9.37) g(x,y, k) = anal 


Now recall from Proposition 1.6 how we can obtain the eigenfunctions 
(9.38) u(njeyse Ss +u(m,€) 
from the asymptotic behavior of G(x, y, k) as |y| + 00, via 


ikr 
(9.39) A(e,rw,k) = <—v(2, kw) + O(r-?), r+ 00, 


proved in (1.37). We therefore have 


(9.40) v(r0, kw) = lim se~**ha(r,s,k), 


8—0O 


where we set 


9. Scattering by a sphere 285 


(9.41) h,(r, 8, k) = 40 b(rs)-0/2 2) 0) (sk) HO (rk). 
HS (k) 


As before we identify a function of (9,w) with an operator on C%(S”), with A 
acting on functions of 6. To evaluate the limit in (9.40), we can use 


(9.42) H(A) = ec 


ye e-wa-a an o(A72/2), > 00, 
TX 

which can be deduced from the integral formula (9.7). We obtain 

(9.43) v(r0, kw) = V(A,1r,k), 


where 


2 1/2 . : 
(9.44) -V(v,r,k) = 2074 (—) ea /2ynitvgt/2) F(R) H (rk). 
ark HS” (k) 


We can now evaluate the scattering amplitude, which satisfies 


(9.45) a(—w,6,k) = lim re~*u(r6, kw), 


TOO 


according to (1.41). Using (9.42)-(9.44), we have 


4 k . 
(9.46) a(—w,6,k) = > ue ) gaia 
4 °(k) 


In other words, if the right side is =(A), then 

3(4)4(0) = f a(—w, 6,4) f(w) ds 
Now, as shown in the study of harmonic analysis on spheres, in (4.44) of Chap. 8, 
(9.47) eo" fw) = —i f(—w),  f € L7(8?), 


so we can write 


Ani Ja(k) 
(9.48) a(w,0,k) = —— 
k HS (k) 


Recall that the scattering amplitude a(w,0,k) is related to the scattering 
operator S(k) by 


(9.49) 6) = 72 AG), 


271 


286 9. Scattering by Obstacles 


where a(w, 0, k) is the kernel of A(k), by (3.14)-(3.15). In other words, A(k) is 
the operator on the right side of (9.48). Therefore, the scattering operator itself 
has the form 


(9.50) S(k) = Ha) 
Ay ’(k) 
in view of the identity 
(9.51) HY (A) + H(A) =2 Fd). 
We also note that 
(9.52) H)(k) = HS (h), 


for v and & real, so (9.50) explicitly displays the unitarity of the scattering 
operator, for real k. 

The investigation of scattering by a sphere can be carried further, based on 
these formulas. For example, qualitative information on the zeros of HS” (d) 
yields qualitative information on the scattering poles. Some of the most delicate 
results on such scattering make use of the uniform asymptotic behavior of H, a (A) 
as v and . both tend to oo. A treatment of this in a modern spirit, touching on more 
general approaches to diffraction problems, is given in [T2] and [MT1], and, in 
more detail, in Appendix C of [MT2]. Also, [Nus] gives a lengthy analysis of 
scattering by a sphere, from a more classical perspective. 


Exercises 
1. Derive from (9.7) that Bg) = (2z/m)/22- pm (z)e%, with 
= (je co ee 5s \m 
Pm(z) = a " e Ss (- = =) ds. 


Show that this yields (9.19). 
2. From the material on Bessel functions developed in Chap.3, show that there is the 
Wronskian identity 


HOY O)HP (A) — HPO)HE"O) = S, 
and evaluate C’. Using this, prove that HE (A) is not zero for any A € (0,00), 
v € (0,00). 

3. Use results on the location of scattering poles from §7 to show that (9.13) and (9.14) 
imply HS? (z) has zeros only in Im z < 0, forv = m+1/2, m=0,1,2,.... 
It is known that this property holds for all v € [0, 00). See [Wat], p. 511. There it 


Exercises 287 


is stated in terms of the zeros of K,(z), which is related to the Hankel function by 
K.(z) = (wi/2)e"”/?2 HS (iz). 
4. A formula of Nicholson (see [Olv], p. 340, or [Wat], p. 444) implies 


J(zP? + VY (z" = “a Ko(2z sinh t) cosh 2vt dt, 


for Re z > 0. Here Ko(r) is Macdonald’s function, the v = 0 case of the function 
mentioned in Exercise 3; cf. (6.50)-(6.54) in Chap.3. Ko(r) is a decreasing func- 
tion of r € (0,00), and hence, for fixed v > 0, J,(x)? + Y,(ax)? is a decreasing 
function of 2 € R*. Show that this implies that 


B(k,r) : L?(S”) — L?(S?), 


defined by B(k,r)f(0) = B(k) f(r), has operator norm < r~'/?, for r > 1. 
Consequently, 


(9.53) BCE) fll x20, )x)-4ax) S Wf ll2¢82)- 


Using the integral formula (6.50) of Chap. 3, show that ro(r) is decreasing on RT, 
hence that |r/?, HS" (r)| is decreasing on R*, for fixed v > 0. Use this to show that 
\|B(k,r)|| <7? for r > 1, and sharpen (9.53). 

5. Let A= {a € R®: 1 < |x| < 2}. With u = B(k)f, use Au = —k?u and estimates 
derivable from Chap. 5, in concert with Exercise 4, to show that 


(9.54) BCE) fll x2 cay S Cll fllags/2(g2) + Ch? | flln2(s2)- 


Deduce that 


(9.55) INC) fl ra/2(52) < Cll fll 3/292) 1h k? || fllz2(s2)- 


6. Show that 
|k Q(m + 1/2,k)| < C(|k| +m+1), 


fork € R, m > 0. Deduce that, fors € R, k ER, 
(9.56) INV (k) f llars(s2) < Coll fllzrst1(92) + Cslk] - If ll arss2)- 


Compare this with the bound on \’(k) derived in the previous exercise. 
(Hint: Consider uniform asymptotic expansions of Bessel and Hankel functions, dis- 
cussed in [Erd] and in Chap. 11 of [Olv]. Compare a related analysis in [T2].) 

7. Suppose an obstacle K is contained in the unit ball B; = {|x| < 1}. Show that 
the solution to the scattering problem (1.1)-(1.3) is uniquely characterized on Q) = 
(R° \ K) MB, as the solution to 


(9.57) (A+k’)v=00n%, v=fondK, 3° = N(h)w on 8°, 
ir 
where \V(k) is given by (9.14). 
8. Derive the formulas of this section, particularly the formula analogous to (9.50) for 
S(k), in the case of scattering by a sphere of radius R, centered at p € R°, displaying 
explicitly the dependence of the various quantities on R and p. 


288 9. Scattering by Obstacles 


9. It follows from (9.46)-(9.48) that the scattering amplitude for S ? satisfies 
(9.58) a(w,0,k) =a(0,w,k) and a(w,0,k) = a(—w,—0,k). 


Demonstrate these identities directly, for 0K = S?. How much more generally do 
they hold? Compare (3.31). 
10. Suppose v € C’™(IR*) solves (A + k?)v = 0. Show that v(r@) has the form 


v(r0) = oS vj (7) y;5 (8), 


where (p; is an eigenfunction of As, as in (9.4), and u;(r) = bj jv;—-1/2(kr). 
(Hint: v; (1) solves (9.5) and does not blow up as r — 0.) 
Deduce that, for some coefficients Ge, 


(9.59) ere? S$” Be je(kr) Pe(w- 8), 
£=0 


where P;(t) are the Legendre polynomials, defined in (4.36) of Chap. 8. 
As shown in (4.49) of Chap. 8, this formula holds with Gz = (2@ + 1)%*, so 
(9.60) e' = §"(20+1) i je(s) Pe(t), s€R, ¢€[-1,]]. 
£=0 


11. As noted in §4 of Chap. 8, the identity (9.59), with Ge = (2 + 1)i*, is equivalent to 
the assertion that e’*"’’® is the integral kernel of 


(9.61) Ber (A) = 4m et/2)7HA-1/2) 5s (kr). 


Show that this is in turn equivalent to the r = 1 case of (9.43)-(9.44). 
(Hint: Use (9.47).) 

12. Derive explicit formulas for scattering objects (e.g., S(k)), in the case of the quantum 
scattering problem for H = —A + V, when 


V(x) =b, for |z| < R, 


0, otherwise. 


Keep track of the dependence on 6 and R. If you fix R = 1 and let b decrease from 
b = 0 to the first value b = —(o, below which —A + V has a negative eigenvalue, 
what happens to some of the scattering poles? 


10. Inverse problems I 


By “inverse problems” we mean problems of determining a scatterer OK in terms 
of information on the scattered waves. These problems are of practical interest. 
One might be given observations of the scattered wave v(, kw), for x in a region 
not far from OK, k belonging to some restricted set of frequencies (maybe a 
single frequency). Or one might have only the far field behavior, defined by the 


10. Inverse problems I 289 


scattering amplitude a(w, 0, k), which we recall is related to u(x, kw) by 


tkr 


. a(—w,0,k), roo. 
, 


(10.1) v(r0, kw) ~ 


In this section we examine the question of what scattering data are guaranteed to 
specify OK uniquely, at least if the data are measured perfectly. 

It is useful to begin with the following explicit connection between the scat- 
tered wave u(x, kw) and the scattering amplitude. 


Proposition 10.1. If K C Br(0), then, forr > R, 

(10.2) a(—w,0,k) = ik} e(1/2)m(A-1/2) by so (kr)—19(6), 
where g = Gr,w,k is given by 

(10.3) g(@) = u(r, kw). 


As in 89, ha_1 jo(kr)—* is regarded as a family of functions of the self-adjoint 
operator A defined by (9.10). Recall that h,,,(A) is given by (9.17)-(9.18). 


Proof. This result follows easily from (9.12), which implies 


ha-1/2(kr) 


(10.4) B(k)f(r0) = ey 


f(9), 


for f € C%(S?). To prove (10.2)-(10.3), we can suppose without loss of gener- 
ality that R = 1 and apply (10.4) with f(@) = v+(6, kw) to get 


ha-1/2(kr) 


v+(r6, kw) = haat 


(0); f(A) = v(0, kw). 


Now compare the asymptotic behavior of both sides as r — oo. For the left side 
we have (10.1), while the behavior of the right side is governed by 


ikr 


(10.5) Pin(kr) ~ im? a 


by (9.18)-(9.19), so (10.2) follows. 
Now we can invert the operator in (10.2), to write 


(10.6) v(r0, kw) = ik eV aA 1) ha-1/2(kr)a(—w, 0,k), 


where the operator acts on functions of 9. The operator h4_1/2(kr) is an un- 
bounded operator on L?(S7); indeed, it is not continuous from C%°(S?) to 
D’'(.$*), which has consequences for the inverse problem, as we will see in §11. 


290 9. Scattering by Obstacles 


FIGURE 10.1 Two Obstacles 


Suppose now that Ky and K are two compact obstacles in R® giving rise 
to scattered waves which both agree with v(2,kw) in some open set O in 
R? \ (Ky, U Ka). In other words v; (x, kw) = v(x, kw) for x € O, where the 
functions v; are solutions to 


(10.7) (A+k?)v; =OonR®\ Kj, vj =—e7**” on OK;, 


satisfying the radiation condition. We suppose the sets K; have no “cavities”; that 
is, each 2; = R® \ K; has just one connected component. In this case, possibly 
the complement of A, U fg is not connected; cf. Fig. 10.1. We will let 2/ denote 
the unbounded, connected component of this complement, and consider R? \U, 
which we denote by A, so Ky C Ko. This is illustrated in Figs. 10.1 and 10.2. 
We assume O C U. Let R be any connected component of the interior of K2\ Ky. 
(Switch indices if Ky C Ky.) 

The functions v1 and v2 described above agree on U/, since they are real analytic 
and agree on O. Thus uj and uz agree on U, where uj(x) = vj (x) + oY. 
Since each uw; vanishes on OK j> it follows that u = uy li vanishes on OR, so 


(10.8) (A+k?)u=OonR, u=O0ondR. 


In fact, u € A ; (R). However, u does not vanish identically on R. In particular, 
u provides an eigenfunction of A on each connected component F of the interior 
of Ky \ Ky, with Dirichlet boundary condition (and with eigenvalue —k?) if u 
is not identically zero, and if the symmetric difference Ky, A 2 has nonempty 
interior. Now, there are circumstances where we can obtain bounds on 


(10.9) dim ker (A + k? = d(k); 


nace) 


for example, if we know the obstacle is contained in a ball Br. We then have the 
following uniqueness result: 


10. Inverse problems I 291 


FIGURE 10.2 Filled Obstacle 


Proposition 10.2. Let k € (0,00) be fixed. Suppose © = {we} is a subset of S? 
whose cardinality is known to be greater than d(k) /2. (If we and —wz¢ both belong 
to 4, do not count them separately.) Then knowledge of v4(a,kwe) for x in an 
open set O uniquely determines the obstacle K. Hence knowledge of a(—w, 6, k) 
for 0 € S? uniquely determines K. 


Proof. If K were not uniquely determined, there would be a nonempty set R such 
as described above. The corresponding ue(x) = v(x, kw) +e**“¢, together with 
their complex conjugates, which are all eigenfunctions on 7, must be linearly 
independent. Indeed, any linear dependence relation valid on ® must continue on 
all of R® \ (Ky Ka); but near infinity, ue(2) = e***”* + O(|az|~1) guarantees 
independence. 


We make a few complementary remarks. First, a(—w,0,k) is analytic in its 
arguments, so for any given w, k, it is uniquely determined by its behavior for 6 
in any open subset of S$”. Next, for k small enough, we can say that d(k) = 0, 
so uniqueness holds in that case, for a single w = wy. Note that even when k? is 
an eigenvalue of —A on R, it would be a real coincidence for a corresponding 
eigenfunction to happen to continue to R® \ (K, MN K2) with the appropriate be- 
havior at infinity. It is often speculated that knowledge of a(—w, 0, k), for 9 € S? 
(or an open set) and both k and w fixed, always uniquely determines the obstacle 
ix. This remains an interesting open problem. 

Furthermore, suppose a(—w,6,k) is known on 6 € $7, for a set {we} C 
S? and a set {k,,} C R*. Then one has uniqueness provided card{w,} > 
min d(k,,) /2. In particular, if {k,,} consists of an interval I (of nonzero length), 
then mind(km) = 0, so knowledge of a(—w,0,k) for? € S?, k € I, anda 
single w uniquely determines K. 

All of these considerations are subject to the standing assumption made 
throughout this chapter on the smoothness of Ok. There are interesting cases 
of non-smooth obstacles, not equal to the closure of their interiors, to which the 
proof of Proposition 10.2 would not apply. We will discuss this further in §12. 


292 9. Scattering by Obstacles 


We also mention that the method used to prove Proposition 10.2 is ineffective 
when one has the Neumann boundary condition. A uniqueness result in that case, 
using a different technique, can be found in [CK2]; see also [Isa]. 

One study that sheds light on the inverse problem is the linearized inverse prob- 
lem. Here, given an obstacle K’, denote by Bx (k) the solution operator (7.1)-(7.2) 
and by Sx (k) the scattering operator (3.7), with corresponding scattering ampli- 
tude ax (w,0,k), as in (3.14)-(3.15). We want to compute the “derivative” with 
respect to KX of these objects, and study their inverses. 

More precisely, if A is given, 0K smooth, we can parameterize nearby smooth 
obstacles by a neighborhood of 0 in C° (0K), via the correspondence that, to 
wy € C™(OK) (real-valued), we associate the image 0K, of OK under the map 


(10.10) Fy(z) =x+y(a2)N(2), «2 € OK, 
where N(x) is the unit outward-pointing normal to OK, at x. Then, denote 


Bx,,(k) and ax, (w,6,k) by By(k) and ay (w, 0, k). We want to compute 


) 
a. Beulk) f] <0 


(10.11) Dy Bx(h)f = 5 


and =Dy ax(w,9,k) = Osasy(w, 9, 3) ere The following is a straightforward 
exercise. 


Proposition 10.3. If f is smooth near OK and vy(x) = OBaplk)f| ay for 
x € R®\ K, then vy (x) is uniquely characterized by 


(10.12) (Ze = ik) >0, asr—> ov, 
3) 
oy = (2) (NO)F— SF) on aK 


Here, \’(k) is the Neumann operator, defined by (7.55). In other words, 


(10.13) Dy Bu (h)f = Buta) foray (ways — SE). 
The linearized problem is to find w. 

Therefore, for a given smooth obstacle K, granted that the operators Bx (k) 
and N(k) have been constructed (e.g., by methods of §7), we can to some de- 
gree reduce the linearized inverse problem for w to the following linear inverse 
problem: 


Problem. Given (an approximation to) w = B(k)g(x) on |x| = R, (and assum- 
ing that K C {x : |x| < R,}), find (an approximation to) g on OK. 


10. Inverse problems I 293 


As for finding Bx (k) and N(k) via an integral-equation method, we mention 
that an integral equation of the form (7.18) is preferable to one of the form (7.12), 
since it is very inconvenient to deal with the set of values of & for which (7.12) is 
not solvable. This point is made in many expositions on the subject, such as, [Co]. 
Solving (7.18) leads to the formula (7.32) for Bx (k). 

We note that when we take f = e— **’-® the solution to the linearized inverse 
problem is unique: 


Proposition 10.4. Given K nonempty, smooth, and compact, such that R° \ K is 
connected, define 


(10.14) Lx(k,w): H°(8K) > c™(R3 \ K) 
by 
(10.15) Lx (kw) = DyBrlkyf, f(a) = ew thw 


Then Lx (k,w) is always injective. 
Proof. By (10.13), our claim is that 


of 


(10.16) Brc(k){ U2) (MORE = a) =0 onR?\ KW =0ondK. 


Since Bx (k)9| 16 = g, the hypothesis in (10.16) implies o(x) (M(k)f —O,f) = 
0 on OK, so it suffices to show that 


(10.17) N(k)f — or vanishes on no open subset O of 0k 
Vv 


when f(x) = e~**“’”. To see this, consider w = B(k) f —e~"*”"”, which satisfies 
(10.18) (A+ k?)w=O0onR?\ K, w=OondKk. 
If N(k)f — 0, f = 0 on O, then 


(10.19) Oey GeO), 
Ov 


But if O is a nonempty, open subset of OK, then (10.18)-(10.19) imply that w 
is identically zero, by uniqueness in the Cauchy problem for A + k?. This is 
impossible, so the proof is complete. 


Parallel to (10.13), we have 


Dyare(—w, 0,4) = Anc(b){vlay (ys — 54) 40), 


ij=e*, 


(10.20) 


294 9. Scattering by Obstacles 
where Ax (k) is as in (3.33)-(3.34). Note that (10.2) extends readily to the identity 
(10.21) Ar(k)f = —ik7* et/2)"€A-1/2) bys io(kr)~! Bx,(k)f. 


In view of the injectivity of the operator acting on Bx,(k) f, on the right side of 
(10.21) we see that, under the hypotheses of Proposition 10.4, we have 


(10.22) Lx(k,w) : H*(8K) —> C%(S?) _ injective, 


for each k € R, w € 5S”, where 


(10.23) Lu(k,w)y = DyAx(k)f, f= ew thw 


Exercises 


1. Fix k € (0,00). Show that a given obstacle K is a ball centered at 0 if and only if 
a(w,6,k) = a(Rw), RO), k), 


for every rotation R : S? — $7. (Hint: For the “if” part, make use of Proposition 10.2 
to compare Kc and its image under a rotation.) 

2. Suppose you are given that K is a ball, but you are not given its radius or its cen- 
ter. How can you determine these quantities from the scattering amplitude? How little 
information on a(w, 0, k) will suffice? 

3. The set R arising in the proof of Proposition 10.2 might not have smooth boundary, so 
how do you know that u = w1 | zz’ Which vanishes on OR, belongs to Hg(R)? 

4. Suppose K is known to be contained in the unit cube Q = {a € R?: 0 < a; < 1}. 
Let w € S”, k € R be fixed. Show that exact knowledge of a(—w,6,k), for 9 € S$”, 
uniquely determines K, as long as 


|kl < V6 a. 


Given wi,w2 € S?, such that w1,w2,—wi, and —wz are distinct, show that 
a(—w;,0,k), for k fixed, 0 € S*, 7 = 1,2, uniquely determines K, as long as 


|k| < 37. 


Can you improve these results? 
5. Give a detailed proof of Proposition 10.3. 


11. Inverse problems II 
In this section we describe some of the methods used to determine an obstacle K 


(approximately) when given a measurement of scattering data, and we deal with 
some aspects of the “ill-posed” nature of such an inverse problem. 


11. Inverse problems II 295 


For simplicity, suppose you know that B; C K C Bpr,, where B, = {x € 
R® : |x| < r}. Suppose you have a measurement of a(—w,6,k) on 6 € S?, with 
k; fixed and w fixed. One strategy is to minimize 


O(f, Kk) _ | Ai (k) f = a(—w,-, k)||Z2(92) 


11.1 ; 
a + Bic(i)F +e™" Bacar) 
with f and K varying over certain compact sets, determined by a priori 
hypotheses on the scatterer. This is close to some methods of Angell and 
Kleinman, Kirsch and Kress, as described at the end of [Co2]. 
Here we use the following notation: A,.(k) = Ag, (k), where Ax (k) is as in 
(10.20), that is, 


(11.2) Ax(k)f(0) = lim re~*" Be(k)f(r6). 


T—FOO. 


Also, if Ay is contained in the interior of Ko, then g = Br, (k) flex, defines a 
bounded operator 


(11.3) Br, K>(k) : H*(0K,) —_ C™ (0K), 


and if either kK, = B, or Kz = B,, we use the notation B,«.,(k) or Bx, p(k); if 
both A and K2 are such balls, we use the notation B,.p(k). 

More generally, we might have measurements of a(—w;,9,k) on 6 € S?, for 
a sequence of directions w;. Then one might take 


N 
2 
O= SA) Fj _ a(—w;, “4 k) Ih 2(92) 
(11.4) - 
—ikw.-x||2 
+ So Bux(h)f; a Ieecox) 
j=l 
and minimize over (f1,..., fn; &). One might also consider weighted sums, and 
perhaps stronger norms. 
Note that 
(11.5) Ai(k) fj = Ak (k) Bix (k) fj: 


The feasibility of approximating the actual scattered wave by such a function 
follows from the next lemma, provided Kx is connected and has connected com- 
plement. 


296 9. Scattering by Obstacles 


Lemma 11.1. /f Kj are compact sets in R° (with connected complement) such 


fe) 
that kK, C Ko, then for any k © R the map Bx, x,(k) is injective. If also K2 is 
connected, this map has dense range. 


Proof. If wu = Bx,(k)f vanishes on OK, then wu restricted to R® \ Ko is an 
outgoing solution to the basic scattering problem (1.1), with K = Ko, so by the 
uniqueness of solutions to (1.1)-(1.3), we have u = 0 on R® \ Ko. Then unique 
continuation forces u = 0 on R® \ Ky, so the injectivity of (11.3) is established. 


As for the second claim, note that if y € 1, then 
(11.6) |x — y|-te%*l*-4l = gy (x) € Range Bx, (k). 


Thus if f € L?(0K2) is in the orthogonal complement of the range of Bx, x, (k), 
we deduce that 


(17) F(2) = / Flu)ge(v) 45(y) 


OK2 


is zero for x € Ky, hence for x € Ko (if K2 is connected). Also material in §7 
implies that F’ is continuous across 0/2 and is an outgoing solution of (1.1) on 
IR? \ Kg. The uniqueness of solutions to (1.1)-(1.3) forces F = 0 on R? \ Ko. 
Since, by (7.25), the jump of 0, F' across 02 is proportional to f, this implies 
f =, proving denseness. 


If & is not known to be connected, one could use several spherical bodies as 
domains of f; in (11.4), provided it is known that each connected component of 
ix contains one of them, as can be seen by a variant of the proof of Lemma 11.1. 

Instead of minimizing (11.4) over (f1,..., fn; A), an alternative is first to 
minimize the first term of (11.4), thus choosing f;, within some compact set of 
functions, and then to pick K to minimize the second term, within some compact 
set of obstacles. 

An attack pursued in [Rog] and [MTW] takes a guess kK, of K, solves (ap- 
proximately) a linearized inverse problem, given K,,, and applies an iteration, 
provided by Newton’s method, to approximate 1. See also [Kir]. 

If one has a measurement of the scattered wave v(x, kw) on the sphere |x| = r, 
say for & fixed and w = w ,...,wy, instead of a measurement of a(—w, 6, k), 
then parallel to (11.4) one might take 


= Sst — 06, kw) Ee 
(11.8) 


+ Sate (k) fj + es alee 


11. Inverse problems II 297 


and minimize over (f1,..., fn; K). Alternatively, first minimize the first sum 
over f; (in some compact set of functions) and then minimize the second sum 
over KK (in some compact set of obstacles). 

In fact, a number of approaches to the inverse problem, when measurements 
of the scattering amplitude a(—w,6,k) are given, start by first constructing an 
approximation to u(x, kw) on some sphere |x| = r, such that kK C B,, and then 
proceed from there to tackle the problem of approximating /. Recall the relation 
established in §10: 


(11.9) v(r0, kw) = ik ens) ha-1/2(kr) a(—w,@,k), 


where the operator acts on functions of @. As noted there, the operator h4_ 1 /2(kr) 
is a seriously unbounded operator on L?(97). In fact, this phenomenon is behind 
the ill-posed nature of recovering the near field behavior v(a, kw) from the far 
field behavior defined by the scattering amplitude a(—w, 0, &). As this is one of the 
simplest examples of an ill-posed problem, we will discuss the following problem. 
Suppose you know that an obstacle K is contained in a ball Br,. Fixk € R, w € 
ia 


Problem A. Given an approximation b(—w,6,k) to the scattering amplitude, 
with 


(11.10) lla(—w, -, k) — b(—w, -, k)||z2¢92) < «, 


how well can you approximate the scattered wave u(x, kw), for x on the shell 
|x| = Ri, given Ry > Ro? 


As we have said, what makes this problem difficult is the failure of the operator 


ha-1/2(kr) appearing in (11.9) to be bounded, even from C°°(S?) to D’(S?). 
Indeed, for fixed s € (0,00), one has the asymptotic behavior, as v —> +00, 


1/2 v 
(11.11) HO (s) ~ i( 2 ) (=) 
TV es 


(see the exercises) and hence 


2 Vv 
(11.12) hy 1ja(a) ~ —i(ev)73/? (=) . 


Consequently, an attempt to approximate v(x, kw) for « = R10 by 
(11.13) v9(0) = ike O/2)"(A-1/2) hy 79(kR1)b(—w, 0, k) 
could lead to nonsense. We will describe a method below that is well behaved. 


But first we look further into the question of how well can we possibly hope to 
approximate u(x, kw) on the shell |z| = R, with the data given. 


298 9. Scattering by Obstacles 

In fact, it is necessary to have some further a priori bound on v+ to make 
progress here. We will work under the hypothesis that a bound on u(x, kw) is 
known on the sphere |x| = Ro: 
(11.14) l|v( Ro, kw) || p2(92) < E. 
Now, if we are given that (11.10) and (11.14) are both true and we have 
b(—w,@,k) in hand (for w,k fixed, @ € $7), then we can consider the set F 
of functions f(@) such that 
(11.15) \| f — b(—w, +, k)||x2(s2) 
and such that 
(11.16) |k ha-1/2(kRo) fllz2(92) < FE, 
knowing that F is nonempty. We know that a(?) = a(—w,0,k) belongs to F, 
and that is all we know about a(—w,6,k), in the absence of further data. The 
greatest accuracy of an approximation v;(0) to v(R,0, kw) that we can count on, 
measured in the L?(S*) norm, is 


(11.17) I|v1(9) — v( R16, kw) || 12(92) < 2 M(e, B), 


where (ce, F) is defined as follows. Denote by 


(11.18) T;: F — 1(s?), j=0,1, 
the maps 
(11.19) T; f(0) = tke V/2UA-1/2) hy 9 ks) f(8). 


Then we set 

(11.20) 2M (ce, E) = sup {||Ti f — Tigllz2(s2) : f,9 © F}, 

that is, 

(1121) M(e,£) =sup {\lT.fllz2 = [lfllzz < and ||Tofllie < Eh: 


One way to obtain as accurate as possible an approximation to v on |x| = 
R, would be to pick any f € F and evaluate T; f. However, it might not be 
straightforward to obtain elements of 7. We describe a method, from [Mr2] and 
[MrV], which is effective in producing a “nearly best possible” approximation. 

We formulate a more general problem. We have a linear equation 


11. Inverse problems II 299 
(11.22) Sv=a, 


where S' is a bounded operator on a Hilbert space H, which is injective, but S~! 
is unbounded (with domain a proper linear subspace of 7). Given an approximate 
measurement b of a, we want to find an approximation to the solution v. This is a 
typical ill-posed linear problem. As a priori given information, we assume that 


(11.23) |b-alla<e, ||Toalla < E. 


Tp is an auxiliary operator. In the example above, H = L?($7), and Tp and T) 
are given by (11.19). Generalizing (11.20)-(11.21), we have a basic measurement 
of error: 


M(e, B) = sup {||Tifllz: \Ifllz < ¢ and ||Tof\lx < E} 


(11.24) 1 

= 5 sup {Tif —Tiglla: f.9 € F}, 
where 
(11.25) F={feH:|f-blax<e,|lToflla < E}. 


Now, if all one knows about a in (11.22) is that it belongs to F, then the greatest 
accuracy of an approximation vj to the solution v of (11.22) one can count on is 


(11.26) lv. — vllz < 2M(e, EB). 


This recaps the estimates in (11.14)-(11.21). Now we proceed. An approxima- 
tion method is called nearly best possible (up to a factor y) if it yields av, € H 
such that 


(11.27) lv. — voll < 2yM(e, EB). 


We now describe one nearly best possible method for approximating v, in cases 
where To is a self-adjoint operator, with discrete spectrum accumulating only 
at +oo. Then, pick an orthonormal basis {u; : 1 < 7 < oo} of H, consisting 
of eigenvectors, such that 


(11.28) Touj =ajuj, aj; +00. 
When To is given by (11.19), this holds as a consequence of (11.12). It is essential 
that the a; be monotonic, so the eigenvectors need to be ordered correctly. Now let 


L 
(11.29) fo=Pob, Pow =D (u,uy)uy. 


j=l 


300 9. Scattering by Obstacles 

Now let N be the first € such that 

(11.30) Ife — lly < 2¢. 
We then claim that 

(11.31) IIZofivllar < 2E. 


This can be deduced from: 

Lemma 11.2. If the set F defined by (11.25) is nonempty, and if M +1 is the first 
j such that a; > E'/e, then 

(11.32) I| fur — Bll < 2€ and ||Toful|n < 2E. 


Proof of lemma. The key facts about / are the following: 


|Pacg|| < ¢ => ||ToPugll| < £, 


(11.33) 
||(Zo(1 — Pur)h|| < E => || — Pw)All < e. 


We are given that there exists f such that 
(11.34) If — bl] <e and ||Tofl| < B. 


The first part of (11.34) implies ||Parf — fiz|| < ¢, which via the first part of 
(11.33) yields the second part of (11.32). The second part of (11.34) implies 
||Zo(1 — Pas) f || < E, which by the second part of (11.33) gives ||(1 — Pas) f|| 
< e. Since || f — b|| < ¢, this yields the first part of (11.32). 


Having the lemma, we see that N < M, so ||Tofn|| < ||Zofm 
(11.31). Then (11.30) and (11.31), together with (11.23), yield 


, giving 


(11.35) fn — alla <3e and ||To(fw — a) || < 32. 


We have established: 


Proposition 11.3. Under the hypotheses (11.23), if we set uy = T, fn, where N 
is the smallest £ such that (11.30) holds, we have 


(11.36) lun — lly <3 M(ce, B). 


Hence this method of approximating v is nearly best possible. Note that the 
value of the estimate F of (11.14) does not play an explicit role in the method 
described above for producing the approximation vj; it plays a role in estimating 
the error vy — v. 

The method described above provides a technique for solving a certain class 
of ill-posed problems. Other related problems involve the analytic continuation of 


11. Inverse problems II 301 


functions and solving backwards heat equations. Further discussions of this and 
other techniques, can be found in papers of K. Miller [Mr1], [Mr2], and references 
given there. 

We now turn to the task of estimating M(e,E), for our specific prob- 
lem, defined by (11.18)-(11.21). Thus, with Ro < Ri, we want to estimate 
\|ha—1/2(KR1)||z2, given that ||f||p2 < e and ||h4_1/2(kRo)f|ln2 < E. If 
we also assume that k lies in a bounded interval, this is basically equivalent to 
estimating || A~!/2e—°4.A4 || ;2, given that 


|flln2 <e, |JA7/%e7*4 A4 fl 22 < EZ, 


where e~* = 2/eRy and e~? = 2/eRi, so a < (3. We can get a hold on this 
using the inequality 


(11.37) pp Eee ay tie Oh) aaa 


valid for v > 1/2, 0 < x < ow, to write 
(11.38) Tvfllu2 <C inf, ("7B + ate), 
weERt 


where y = ( — a, given || f|| <¢, |/Zof|| < £. While picking x to minimize the 
last quantity is not easy, we can obtain a reasonable estimate by picking 
log E/e 


11.39 See Bee 
( ) . log log E/e’ 


in which case 


ya(E/e) B(E/e) 
a Ome 


with a(E£/e) = 1/(log log E/e) and 6(E/e) = (log log log E'/e)/(log log E’/e). 


Consequently, 


(11.40) M(c,E)<CE e ieee Zr" 


As for the exponents that appear in (11.40), note the following values (to three 
digits): 


302 9. Scattering by Obstacles 


c/E a E/e) — B(E/e) 


10°? 655 277 
10-? O17 41 
i 450 359 
io 409 366 
10-6 381 368 
10-7 360 368 


The close agreement of the last two figures in the right column is due to the fact 
that f(y) = (logloglog y)/(loglog y) achieves its maximum value of 1/e at 
y = e° & 3.81 x 10° and is very slowly varying in this region. As for the close 
agreement of the two figures corresponding to ¢ = 10~", note that log log log 
e® = 1. Anestimate similar to (11.40) is also given in [Isa]. 

Even though the analysis in (11.13)-(11.39) does not directly deal with the 
problem of describing OK given an approximation b(w,0,k) to the scattering 
amplitude a(w, 6, k), to some degree it reduces this problem to that of describing 
OK, given the solution u = B(k)f to the scattering problem (1.1)-(1.3) (ie., 
to (7.1)), with u(a) evaluated near |x| = Ry, for a certain class of boundary 
data, namely f(x) = ee x (Where k and w belong to a specified subset 
of R and S?, respectively). One assumes it given that K Cc {zx : |z| < Ro}, 
where Ro < f,. This reduction is an intermediate step in many studies of inverse 
problems. Thus Problem A is complemented by: 


Problem B. Approximate v = B(k)f on |x| = Ro, given (an approximation to) 
v on || = R, and having some a priori estimate of v on |x| = Ro, but not ona 
smaller sphere. 


Rescaling, we can consider the case Rp = 1, Ri = R> 1. By (10.4), we have 


ha—1/2(k) 
ha-1/2(kR) 


(11.41) g = v(@) and w = v( RO) g w=Cr(k)u, 


where the last identity is the definition of the unbounded operator Cr(k) on 
L?(S). In view of (11.12), we have, for fixed k € (0,00), R>1, 


hy—1/2(k) 1/2 
(11.42) We eae. pad, 
hy1j2(kR) 


where y = log R> 0. 

Parallel to (11.14)-(11.21), we consider the problem of estimating Cp(k)w in 
L?(S”), given a small bound on ||zw}| ,2(92) (estimate on observational error) and 
an a priori bound on Cr(k)w in H’(S?), for some @ > 0. That is, we want to 
estimate 


(11.43) M(e, £) =sup {||Ca(k)wlzr2 : |lwllz2 <e, ||Ca(Kwllae < E}.- 


11. Inverse problems II 303 
Parallel to (11.37)-(11.38), we can attack this by writing 
(11.44) em” < (va 1)’e™ +e, 
valid for v,x € (0,00). Thus, if ||g|| ze = || A“g||z2, we have 


(11.45) ICr(k)wllze < Cy inf (cE + ee). 


We get a decent upper bound by setting x = (1/27) log(¢E/ey). This yields 


lE\— Ee 1/2 
(11.46) M(e,E) < Cx(27)‘E (log ) Ce =) 
YE Y 
This is bad news; € would have to be terribly tiny for M/(e, £) to be small. Fortu- 
nately, this is not the end of the story. 
As a preliminary to deriving a more satisfactory estimate, we produce a variant 


of the “bad” estimate (11.46). Fix % € C§°(R), supported on [—1, 1], such that 
w(0) = 1. Instead of (11.43), we estimate 


(11.47) Ms(e, E) = sup{||v(5A)Cr(k) wl r2:||w|) 22 <e, ||Ca(k)wl|r2 < E}. 


We proceed via 


(11.48) w(dv)e™ < e~™ [h(dv)e™” Je™ + eM (dv), 

to get 

(11.49) Io(6A)ewllz2 < inf (Clr, de“ CLE + ee], 
where 

(11.50) C(y,6) = sup w(dvje” < e/? = RV? 


Using (11.42) again, we have the estimate 


(11.51) M3(e, E) < Cy RV” VE. 
Now we do want to be able to take 6 small, to make W(6A)f close to f, but 
R1/2) — ¢7/5 blows up very rapidly as 5 \, 0, so this gives no real improvement 


over (11.46). Compare (11.51) with the estimate 


(11.52) I|p(6A)Ca(k)wllz2 < CrR'” e, 


304 9. Scattering by Obstacles 
when ||w|| 2 < , involving no use of the a priori estimate ||Ce(k)w||z2 < E. 
We now show that a different technique yields a useful bound on the quantity 


M;(e, E), when 6 lies in the range 6 > 1/k. 


Proposition 11.4. Let R > 1 anda > 1 be fixed. Then there is an estimate 
a 
(11.53) \|(0.A)CR(k) wl 22(92) < C||w||12(82); for ” < é. 


In particular, C' is independent of k. 


Proof. Since ~(6A) and Cr(k) commute, it suffices to show that 
(11.54) |Ca(k)w||r2 < Cllw||z2, for w € Range x(5A), 
given a/k < 6, where (A) is the characteristic function of [0, 1]. Thus 
(11.55) 0< A< (ka~')I on Range y(6A). 


Equivalently, we claim that an outgoing solution u(r, w) to the reduced wave 
equation (A + k?)u = 0 satisfies 


(11.56) ||u(1, ) || 22°82) < Cllu(R, -) ||22(92); for u(R, -) € Range x(0A), 


givena/k <0. 
Now u Satisfies the equation 


Pu | 2du 5 2 > | 
(157) Batra t+ (hr %Lju=0, L=A*-7=—As. 


We can replace u(r, w) by u(r, w) = ru(r,w), satisfying 


0?u 7 
(11.58) aa rik =F *r)u =0, 
and it suffices to establish 
(11.59) le, nas) S Clle(B, IIlza(o2), 


given u(R, -) € Range (0A), and assuming that v = ru, wu an outgoing solution 
to (11.57); let us denote by V;,5 the vector space of such functions v. 
It will be convenient to use a family of norms, depending on r and k, given by 


Taal 


(11.60) Ner(v)? = ((1 = (hr)-2L) 0,0) 12(S2) GS 


L2(S2)’ 


11. Inverse problems II 305 


where v = v(r,-) € Veg. Note that Ou/Or = [N;(k) + r—*]v(r, -), where N;.(k) 
is the Neumann operator (7.55), for the obstacle {|x| = r}. By (9.56), extended 
to treat balls of radius r € [1, R], 


IINi-(k) fllza(say S Cll fll (say + CRI llz2(s2) 


(11.61) 
< Ck*| (k? LF, f) 50 + WFllea|- 

Now, by (11.55), 

(11.62) 0 < (kr)~?L < a~7I on Range y(5A), 


given a/k < 6 andr > 1. Consequently, if a > 1, we have constants Cj € 
(0, 00), independent of k, such that 


(11.63) Colle(r)IIZ2¢92) S Ner(v(r))? < Cillollz2(s2); 


for all v € Vys. 

We now show that, for v © Ves, Nzr(v(r))? = E(r) is a monotonically 
increasing function of r € [1, R]; this will establish the estimate (11.59) and 
hence complete the proof of Proposition 11.4. To see this, write 


= =2Re((1- (kr)-*L) ©. ») 
dr Or 
(11.64) 5 a 
t= (en) opel pO 
k2r3 : Or2? Or}? 


and use (11.58) to replace k~* 0?u/Or? by —(1 — (kr)~*L) v. We obtain 


a 2 (er) -2Lw, 0) > 0, 


11. = 
nee) dr r 


and the proof is complete. 


We can place the analysis in (11.60)-(11.65) in the following more general 
context. Suppose 


(11.66) => + A(r)v =0, 


where each A(r) is positive-definite, all having the same domain. If we set 
(11.67) Qr(v) = (A(r)v, v) + ||O-u|]?, 


then 


306 9. Scattering by Obstacles 


d 


(11.68) ap. Or (v) = 2Re(A(r)v, pv) + 2 Re(Orv, Av) 


+ (A’(r)v,v) = (A'(r)v, v). 


If A’(r) can be bounded by A(1r), then we have an estimate 


(11.69) Ee Qn(v)| <CQ,(v). 


Of course, if A’(7) is positive-semidefinite, we have monotonicity of Q,-(v), as 
in (11.65). 

This result indicates that, using signals of wavelength \ = 1/k, one can expect 
to “regularize” inverse problems, to perceive details in an unknown obstacle on a 
length scale ~ A. Further analytical estimates with the goal of making this precise 
are given in [T4]. This idea is very much consistent with intuition and experience. 
For example, a well-known statement on the limitations of an optical microscope 
is that if it has perfect optics, one can use it to examine microscopic detail on 
a length scale approximately equal to, but not smaller than, the wavelength of 
visible light. 

We emphasize that this limitation applies to discerning detail on an obstacle 
whose diameter is much larger than 1/k. If one has a single obstacle whose 
diameter is ~1/k, then one is said to be dealing with an inverse problem in 
the “resonance region,” and, given some a priori hypotheses on the obstacle, 
one can hope to make out some details of its structure to a higher precision than 
one wavelength. This sort of problem is discussed in a number of papers on in- 
verse problems, such as [ACK], [AKR], [JM], and [MTW]. 

The interest in inverse problems extends beyond the setting of scattering 
theory. Important inverse problems arise in the setting of bounded domains. For 
more on such problems, we mention [AK2LT] and [LTU], and references given in 
these papers. 


Exercises 


1. Using the power series for J,(z) given as (6.19) of Chap. 3, show that, for fixed s € 
(0, 00), as v + +00, 
—1/2( eS\” 
Ju(s) ~ (2mv) (=) 
Modify this argument to establish (11.11). 
2. Generalize Proposition 11.3 to the case where different Hilbert spaces (or even different 
Banach spaces) H; are involved, that is, S : H, — Hz andT; : V; — H;, 7 = 0,1, 
where V; C Hoa, af oNnNV. 


12. Scattering by rough obstacles 


In the previous sections we have restricted attention to scattering of waves by 
compact obstacles in R* with smooth boundary. Here we extend some of this 


12. Scattering by rough obstacles 307 


material to the case of compact K C R®, which is not assumed to have smooth 
boundary. We do assume that 0 = R® \ K is connected. 

The first order of business is to construct the solution (in a suitable sense) to 
the problem 


(12.1) (A+ k?)v=00onQ=R*\K, v=fondK, 
satisfying the radiation condition 


(12.2) lru(x)| <C, (2 - ikv) +0, asr— oo, 
ir 


given /; > 0. Our analysis will use a method from §5 of Chap. 5; we take compact 
A; with smooth boundary such that 


O° 


(12.3) ky DD K25D°-:- DD Kj \K. 


Set 0; = R? \ Kj, 800; AQ. 

Let us assume f | ax is the restriction to 0K of some f € C2 (R3). We will 
extend the method of proof of Theorem 1.3. Fore > 0, 7 € Zt, let we; € L?(Q;) 
be the solution to 


(12.4) (A on (k + ie)”) Wes = he on Q;, Weillar, ~ 0, 
where 
(12.5) he = —(A+(k+ie)”)f, on. 


Set we; = 0 on K;. Then set v.; = f + wej, 80 
(12.6) (A + (k + ie)”) Ve; =0Oon Q;, Ve; We = ca 


By methods of Chap. 5, §5, for fixed e > 0, as j + 00, we; + we in Hj (Q), 
the domain of (—A)!/?, when A is the self-adjoint operator on L?(Q) with the 
Dirichlet boundary condition on 0Q = OK, and 


(12.7) we = (A+ (k+ie)?) “he € HY(Q). 
We have We| ak O in a generalized sense. It follows that v-; + ve = we + f in 
H+(Q), and 


(12.8) (A+ (k+ie)?)ue=O00n, velar =f, 


308 9. Scattering by Obstacles 


the boundary condition holding in a generalized sense. Furthermore, uv, is the 
unique solution to (12.8) with the property that v. — f € Hj(Q). 

If f € C§(R*) is supported in By = {|x| < A}, then elliptic estimates imply 
Wej > We in C™(R? \ Ba), hence ve; 4 ve in C~(R? \ Ba). It follows that, 
for any fixed A; > A, if © = {|x| = A;}, then 


OGe OVe 
129) vel) = f [relay SE — ge] a5(y), Jal > A 
x 


where g- = g(x,y, k + ie) is given by (1.6). Compare with the identity (1.24). 
We now state a result parallel to Theorem 1.3. 


Theorem 12.1. For v- constructed above, we have, as € \, 0, a unique limit 
(12.10) ve 2 v= B(k)f, 


satisfying (12.1)—(12.2). Convergence occurs in the norm topology of the space 
L?(Q, (a)~1~° dr), for any 6 > 0, as well as weakly in H!(Q.N {\a| < R}), for 
any R < oo, and the limit v satisfies the identity 


(12.11) va) = f [ow 52-5] asty), |e > Ar 


a 


More generally, we can replace the boundary condition on vz in (12.8) by uv; = fe 
on OK, with f- > f in C2(R°). 


As in §1, we begin the proof by establishing a uniqueness result. 
Lemma 12.2. Given k > 0, if v satisfies (12.1)—(12.2) with f = 0, then v = 0. 


Proof. Here, to say v = 0 on OK means yu € Hj(), for some x € C§°(R3), 
chosen so x(a) = 1 for |x| < A. The proof of Proposition 1.1 works here, with a 
minor change in the identity (1.8). Namely, write the equation 


(12,13) (A+ k?)v =00n Be \ K, 0, v 


Olas = |e = 


in the weak form 


(12.13) ‘i [—(dv, dy) + kg] dx = [x dS, 
V 
Br\K Sr 


for all p € H'(Br \ K) such that %| ox = 0 and ¢ is smooth near Sp. This 
applies to y = J, yielding 


(12.14) / [—(du, dv) + k*v0] dz = [eo ds. 
Br\K SR 


12. Scattering by rough obstacles 309 


Also, we can interchange the roles of v and v and subtract the resulting identity 
from (12.14), obtaining 


(12.15) [oo —bv,) dS = 0, 
SR 


as in (1.8). The rest of the proof proceeds exactly as in the proof of Proposition 1.1. 


We continue with the proof of Theorem 12.1. Pick R > Aj, and set Op = 
QN {|x| < R}. Parallel to Lemma 1.4, we have 


Lemma 12.3. Assume v, lass is bounded in L?(Op) as € \, 0. Then the conclu- 


sions of Theorem 12.1 hold. 


Proof. Fix S € (Ai, R). Elliptic estimates imply that if ||v-||,2(0,,) is bounded, 
then 


(12.16) \|ve || #2005) <C. 
Passing to a subsequence, which we continue to denote by vz, we have 
(12.17) Ue + v weakly in H!(Os). 


Also, we = Ve — fe 2 w =u — f, and for y € C§°(|x| < S) such that x = 1 on 
a neighborhood of K, we have yw. — yw in Hj(Ogs). Thus lage = f,in our 
current sense. 

Since (A + (k + ic)?)ve = 0 on Op, elliptic estimates imply that if v- Es is 
bounded in L?(Op), then v- is bounded in C°(A < |x| < R). Thus we obtain 
(12.11) from (12.9), and (12.11) implies the radiation condition (12.2) and also 
the convergence v- > v in L?(Q, (x)~!~*dz). 


So far we have convergence for subsequences, but by Lemma 12.2 this limit is 
unique, so Lemma 12.3 is proved. The proof of Theorem 12.1 is completed by: 


Lemma 12.4. The hypotheses of Theorem 12.1 imply that the family {v-} is 
bounded in the space L?(Q, (z)~!~* dz), for any 6 > 0. 


The proof is the same as that of Lemma 1.5. 
The fact that, for each j € Z*, v-; converges as € — 0 to a limit v; solving 
the scattering problem 


(12.18) (A+k*)vj =00nQ;, vj; = f on OK;, 


plus the radiation condition, is a consequence of Theorem 1.3. The following 
approximation result is useful. Extend v,; to be equal to f on Q \ Q;. 


310 9. Scattering by Obstacles 
Proposition 12.5. For any R € (A,oo), 6 > 0, we have 
(12.19) vj > inC™(Q1) NL?(Q, (x) edz). 


More generally, we can replace the boundary condition by v; = f; on 0K ;, where 
f; 2 f in C3 (R®). Furthermore, 


(12.20) vj > inH'(OpR), innorm. 


Proof. To establish (12.19), an argument parallel to that used for Lemmas 1.4 
and 12.3 shows that it suffices to demonstrate that {v,;} is bounded in the space 
L?(Op), and then an argument parallel to that used for Lemmas 1.5 and 12.4 
shows that indeed {v;} is bounded in L?(Q, (x)~1~° dadz). 

Arguments such as those used to prove Theorems 1.3 and 12.1 also show that 
v; > v weakly in H'(Op). To get the norm convergence stated in (12.20), note 
that, parallel to (12.14), 


(12.21) [ [aes <05) + k? |v; |7] dz = [x@n) dS. 
Or Sr 
Since v; > v in L?(OR) and v; — v in C® on a neighborhood of Sp = 


{|a| = R}, we have J, k?|u;|? dx > Son k?|v|? dx and See v;(O-v;) dS 
Jog, ¥(Ar8) aS. Consequently, 


[lees dx —> / |dv|? dx, 
Or OR 


sO 


(12.22) lv leon) — Ilullat(oR)- 


This, together with weak convergence, yields (12.20). 


It is useful to note that if we extend v; to be f; on K; and extend v to be f on 
i, then (12.20) can be sharpened to 


(12.23) v; —> vin H'(Br), innorm. 
Now, we have a well-defined operator 
(12.24) Br(k) : C?(K) —+ C™(R? \ K), 


for any k > 0, any compact K C R3, extending (1.19). By (12.11), we have 
asymptotic results on Bx (k) f(x), as |z| — oo, of the same nature as derived 


12. Scattering by rough obstacles 311 


in §1. In particular, the scattering amplitude ax (—w, 0, k) is defined as before, in 

terms of the asymptotic behavior of Bx (k) f(r), when f(x) = —e~**” on OK. 
We next want to discuss the uniqueness of the scatterer, when /C is not required 

to be smooth. A special case of Proposition 10.2 is that if K C Bp is assumed to 

be smooth and one has fixed k € (0,00) and sufficiently many wy € $7, then the 

knowledge that ax (—we,0,k) = 0, V0 € S?, implies K is empty. The appropri- 

ate statement of this result when K is not required to have any smoothness is the 

following: 

Proposition 12.6. Given compact K C Br, fixedk € (0,00), and 0 € S?, then if 

(12.25) ag (—we,0,k)=0, VOES?, 

for a single we € S?, it follows that 

(12.26) cap kK = 0. 


Here “cap ” is the Newtonian capacity of K, which is discussed in detail in 
86 of Chap. 11. One characterization is 


(12.27) cap K = inf {f IV f(x)? de : f € CS°(R°), f =1onnbd of K}. 


One can derive the estimate that if f € Lip(IR*) has compact support and \ > 0, 
then 


(12.28) cap({w © R?: |f(a)| > A}) SA PIV Sllz2- 
See (6.64)—(6.65) of Chap. 11. 
To prove the proposition, first note that, as in the proof of Proposition 10.2, the 
hypothesis (12.25) implies 
(12.29) u(x, kwy) = eve, 


for x € R? \ K , the unbounded, connected component of R? \ K; we may as well 
suppose that kK = K. Fix y € C§°(R?) so that y = 1 ona neighborhood of K. 
Then (12.29) implies 

(12.30) p(a)e*e® © AA (R? \ K). 


Hence 


(12.31) y € Hj(R°\ K). 


312 9. Scattering by Obstacles 


We claim this implies cap K = 0. Indeed, take y, € C§°(IR?\ K) so that y, > y 
in H'-norm. Then f, = y — y, € C§°(R*), f, = 1 ona neighborhood of K, 
and [ |V fL(x)|? dx — 0. By (12.27), this implies cap K = 0, so the proposition 
is proved. 

We now want to compare two nonempty obstacles, K, and Ko, with identical 
scattering data a(—w,6,k), perhaps for (w, &) running over some set. Our next 
step is to push the arguments used in the proof of Proposition 10.2 to show that 
under certain conditions the symmetric difference ky; A K has empty interior. 
After doing that, we will take up the question of whether cap( A, A K2) = 0. 

So, as in the proof of Proposition 10.2, suppose Ky and K2 are two compact 
obstacles in R® giving rise to scattered waves v; that agree on an open set O 
in the unbounded, connected component of R? \ (Ay U Ko). In other words, 
uj =e **”® + uy; (x, kw) has the properties 


(12.32) (A+k?)u; =0 on R?\ Kj, yu; € H9(R° \ K;), 
and vj; = u;—e “*“* satisfies the radiation condition. Here, we fix p € Cf°(R?) 
such that y = 1 on a ball containing K, U Ko in its interior. We suppose the sets 
kK, have no cavities, so each Q; = R? \ K ; has just one connected component. 
As before, R? \ (Ky U Ky) might not be connected, so let / be its unbounded 
component, and consider K = R3 \U. If Ky # Ko, then either Ky or Kg isa 
proper subset of K. Let us suppose KX, is. 

Note that the functions uw; and uz agree on U, since they are real analytic and 
agree on O. 


Proposition 12.7. For any (w,k) for which Ky and K2 have identical scattering 
data (for all @), so do Ky and K. 


Proof. By the uniqueness result, Lemma 12.2, it suffices to show that 
(12.33) uU=uU=u2 onU 


has the property that pu € H4(U), for any y € C§°(R°) equal to 1 on a neigh- 
borhood of K. In turn, this is a consequence of the following general result. 


Lemma 12.8. Let 9; be open in R", f; € Hj(Q;). Let O be a connected com- 
ponent of Q14 NO Qe. Then 


(12.34) fi=fo=f on O=f € HU(O). 


Proof. It suffices to assume that the functions f; are real-valued. The hypotheses 
imply i; € HA(Q;) and fi, = ff = f* on O. Thus it suffices to assume in 
addition that f; > 0 on 2. Now we can find g, € O§°(Q1) and hy € CE (Q2) 
such that g, + fi in H'(Q1) and hy > fo in H'(Q2). Hence gt — fi and 
hit — fo in Ht-norm. Now 


12. Scattering by rough obstacles 313 
vy =min (gt ,ht)|., 
has the properties 
gy, € Hj(O), wy > f in H'-norm, 


so (12.34) is proved. 


Thus we replace K2 by K (which we relabel as K2), and we investigate 
whether kK, C Ko» can have identical scattering data, for (k,w) belonging to 
some set. We return to the considerations of the functions w,;, as in (12.32). (Now, 
U =z.) 

Suppose Kr, \ Ky has nonempty interior R. Note that OR \ 0Q1 C 21, N ANz2. 
We claim that w = uy le has the property 


(12.35) w € Ho(R). 


This is a consequence of the following general result. 


Lemma 12.9. Let R C Q be open. Then 
(12.36) f € Hy(Q)NC(Q), f =0 ondR\ AN = FI, € Hy(R). 


Proof. It suffices to assume f is real-valued. Then the hypotheses apply to f* 
and f~, so it suffices to assume f > 0 on 22. Take f, € C§°(Q), fy > f in 
H'-norm. Then f;* > f in H!-norm. Also, if we define 7.(s) for s > 0 to be 


ne(s)= 0 if0<s<e, 


(12.37) ’ 
s—e ifs >e, 


and extend 7- to be an odd function, we have 7-(f) —> f in H+-norm. Now set 


We see that g, € Hj(R) and g, > f in H'-norm, so we have (12.36). 


Now return to w = uy | jz: We claim this function satisfies the hypotheses of 
(12.36), with Q = Q). To see that w vanishes on OR \ O01, we use the fact 
that uy = ug on Q» and argue that u2 vanishes pointwise on a dense subset of 
OR \ OQ. In fact, a dense subset satisfies an exterior sphere condition (with 
respect to 2). Hence, a barrier construction (applied to the harmonic function 
e*Y ug) gives this fact. Thus we have (12.35). Also, 


(12.39) (A+k?)w=0 on R. 


Of course, w is not identically zero on R, so k? must be an eigenvalue of —A 
on R. Hence we have the following parallel to Proposition 10.2. Suppose we have 


314 9. Scattering by Obstacles 
a bound on 


(12.40) dim ker(A + k? = d(k). 


| nace) 


Proposition 12.10. Let K, and Ke be arbitrary compact obstacles, with no cav- 
ities. Let k € (0,00) be fixed. Suppose % = {we} is a subset of S? whose 
cardinality is known to be greater than d(k)/2. (If we and —we both belong to 
&, do not count them separately.) Then 


(12.41) ax, (—we, 8, k) = ax, (—we,0,k), Vue € x, 0 € Ss? 
implies that K, A Koz has empty interior. 


We next show that under stronger hypotheses we can draw a stronger 
conclusion. 


Proposition 12.11. Jf Ky C Ky are compact sets without cavities in R° and 
(12.42) ax, (—w,0,k) = ax,(—w,9,k), Vw,0€S?, k € (0,00), 
then every compact subset of K2 \ Ky is negligible. 


Proof. What we need to show is that if L is a compact subset of A \ Ay, and if 
G3 € H~'(R®) is supported on L, then 3 = 0. 

By Proposition 12.10, the current hypotheses imply that A. \ Ky has empty 
interior. Hence Kp \ Ky C OQ2, so Kg \ Ky = 01M AQz. Also, as in the 
considerations above, we have u(x, kw) = ua(x, kw) for all « € Qo; this time, 
for all kw € R3 \ 0. We claim that this implies, for all compact LD C 01,9 OQz2, 


(12.43) Be H'(R*), suppBC L=> (ui(-,kw), 8) = 0. 


To see this, we argue as follows (suppressing the parameters k,w): Pick p € 
C§° (1), equal to 1 on a neighborhood of L. Then pug € H}(Q2), so we can 
take a sequence f, € C§°(Mz2) such that f,, > ue in H'(Q2)-norm. We can also 
regard f,, as an element of Cg°(1), and of course (f,,) is Cauchy in Hj (01). We 
claim that 


(12.44) fu 3 ou. in H1()). 
Indeed, we have f,, + w for some w € Hj(1), and hence w = pug on Q2. We 
want to show that w = yu, on 2). Since uy = ue on Qe, we have w = wu, on 


Qe, so 


(12.45) supp (w — yu) CQ1N Ka, 


A. Lidskii’s trace theorem 315 


a set that, in the current setting, is equal to 0; M OO. Of course, if Q7 MN AN2 
has three-dimensional Lebesgue measure 0, we can deduce w = gut. If it has 
positive measure, we argue as follows. First, the characterization w = lim f, 
clearly implies w = 0 on Q; M Ko. Furthermore, material on regular points dis- 
cussed in Chap. 11, 86, applied to the harmonic functions eu; (x), implies that 
limy—+2) U2(x,kw) = 0 for all x € ON2 except for a set of interior capacity 
zero; in the current situation this implies u; = 0 a.e. on Q; M OO2. Hence we 
again have w = yuy, so (12.44) holds. On the other hand, (12.43) follows from 
(12.44). 

Having (12.43), for all € = kw € R® \ 0, we deduce that, given F € Cg° 
(R? \ 0), if we set 


(12.46) g(a) = f ui(0,8).F (8) db, 
then 
(12.47) @eH(R*), supp 6c L=> (g,6) =0. 


However, the set of functions of the form (12.46) is dense in Hj (1), by the 
isomorphism (4.10), which continues to hold in this setting. Thus 


(12.48) GB¢H'(R®), suppBCcL—6=0. 


As discussed in Chap. 5, this means L is negligible. 


A consequence of material in 86 of Chap. 11 is that if a compact set is negli- 
gible, then its capacity is zero. Thus, by Proposition 12.11 (in conjunction with 
Proposition 12.7), when (12.43) holds, Ky A Ko has “inner capacity” zero; see §6 
of Chap. 11 for further discussion of inner capacity. 


Exercises 
1. Extend results of §§1—6 to obstacles considered here, with particular attention to results 
needed in the proof of Proposition 12.11. 


2. Show that, for any open Q C R", the map u +> |u| is continuous on H'(Q). Use this 
to justify the limiting arguments made in the proofs of Lemmas 12.8 and 12.9. 


A. Lidskii’s trace theorem 


The purpose of this appendix is to prove the following result of V. Lidskii, which 
is used for (8.2): 


Theorem A.1. /f A is a trace class operator on a Hilbert space H, then 


316 9. Scattering by Obstacles 
(A.1) Tr A= 5 (dimV;)dj, 


where {; : 7 > 1} = Spec A \ {0} and V; is the generalized X;-eigenspace 
of A. 


We will make use of elementary results about trace class operators, established 
in §6 of Appendix A, Functional Analysis. In particular, if {u,;} is any orthonor- 
mal basis of H, then 


(A.2) Tr A= 5 \(Auj, us), 


the result being independent of the choice of orthonormal basis, provided A is 
trace class. 

To begin the proof, let Eg = Dj<e V;, and let Pp = Q1 ++. + Qy denote the 
orthogonal projection of H onto EF. Thus 


AP; = P, AP), 


restricted to /,, has spectrum {Aj:1 <4 <0}. We will choose an orthonormal set 
{u; : j > 1} according to the following prescription: {u; : 1+ dim Ep_1 <j < 
dim FE} will be an orthonormal basis of R(Q,), with the property that Q,AQ~ 
(restricted to R(Q¢)) is upper triangular. That this can be done is proved in The- 
orem 4.7 of Chap. 1. Note that {u; : 1 < j < dim Ee} is then an orthonormal 
basis of Ev, with respect to which AP, = P,AP is upper triangular. It follows 
that the diagonal entries of PrAP| Ey with respect to this basis are precisely 
Aj, 1 <j < 4, counted with multiplicity dim V;. Inductively, we conclude that 
the diagonal entries of each block Q¢AQ, consist of dim V» copies of Xz. 

Let Ho denote the closed linear span of {u; : 7 > 1}, and Hj the orthogonal 
complement of Ho in H, and let R, be the orthogonal projection of H on H,,. We 
can write A in block form 


_ (Ay B 
(A.3) a=(4 ray 


where A, = R,AR,, restricted to H,. Clearly, Ag and A are trace class and, by 
the construction above plus (A.2), we have 


(A.4) Tr Ap = S "(dim V;)Aj- 


Thus (A.1) will follow if we can show that Tr A; = 0. If H, = 0, there is no 
problem. 


Lemma A.2. If H, 4 0, then Spec A; = {0}. 


Proof. Suppose Spec A; contains an element yp ¢ 0. Since A, is compact on H,, 
there must exist a unit vector v € H, such that Ayu = pv. Let H = Ho + (v). 


A. Lidskii’s trace theorem 317 
Note that 
Av=pvut+w, we Ho. 


Hence H_ is invariant under A; let A denote A restricted to H. Of course, Ho is 
invariant under A, and A restricted to Ho is Ao. 

Note that both T,, = Ao — pl (on Ho) and 7,, = A— pl (on H) are Fredholm 
operators of index zero, and that 


Codim 7,,(H) = 1+ Codim T,,(Ho). 


Hence 


Dim Ker(A — ppl) = 1+ Dim Ker(Ap — pJ). 


It follows that the p-eigenspace of A is bigger than the jz-eigenspace of Ag. But 
this is impossible, since by construction, for any « # 0, the y-eigenspace of Ag 
is the entire -eigenspace of A. Thus the lemma is proved. 


A linear operator K is said to be quasi-nilpotent provided Spec K = {0}. If 
this holds, then (J + zK)~! is an entire holomorphic function of z. The conver- 
gence of its power series implies 


(A.5) sup |z|?||K?||< oo, VzeEC, 

J 
a condition that is in fact equivalent to Spec K = {0}. To prove Theorem A.1, it 
suffices to demonstrate the following. 


Lemma A.3. /f K is a trace-class operator on a Hilbert space and K is quasi- 
nilpotent, then Tr kK = 0. 


To prove Lemma A.3, we use results on the determinant established in §6 of 
Appendix A, Functional Analysis. Thus, we consider the entire holomorphic func- 
tion 
(A.6) p(z) = det(I + zk), 
which is well defined for trace class kK’. By (6.45) of Appendix A, 

(A.7) lo(z)| < Cee#!, Ve>O0. 

Also, by Proposition 6.16 of Appendix A, y(z) 4 0 whenever J+2zK is invertible. 
Now, if / is quasi-nilpotent, then, as remarked above, J + zi is invertible for all 
z € C. Hence y(z) is nowhere vanishing, so we can write 


(A.8) A= f@. 


with f(z) holomorphic on C. Now (A.7) implies Re f(z) < Cz + ¢|z| for all 
€ > 0, and a Harnack inequality argument applied to this gives 


318 9. Scattering by Obstacles 
(A.9) [Re f(z)| << Ci +elz|, Ve>0, 


See Chap. 3, §2, Exercises 13-16. The estimate (A.9) in turn (e.g., by Proposition 
4.6 of Chap.3) implies that Re f is constant, so f is constant, and hence y is 
constant. But, by (6.41) of Appendix A, we have 


(A.10) Tr K =y'(0), 


so the lemma is proved. Hence the proof of Theorem A.1 is complete. 

A proof of Lidskii’s theorem—avoiding the first part of the argument given 
above, and simply using determinants, but making heavier use of complex func- 
tion theory—is given in [Si2]. 


References 


[Ag] S. Agmon, Lectures on Elliptic Boundary Problems, Van Nostrand, New York, 
1964. 

[AK2LT] M. Anderson, A. Katsuda, Y. Kurylev, M. Lassas, and M. Taylor, Boundary 
regularity for the Ricci equation, geometric convergence, and Gel’ fand’s inverse 
boundary problem, Jnvent. Math. 158 (2004), 261-321. 

[ACK] T. Angell, D. Colton, and A. Kirsch, The three dimensional inverse scattering 
problem for acoustic waves, J. Diff: Eq. 46(1982), 46-58. 
[AKR] T. Angell, R. Kleinman, and G. Roach, An inverse transmission problem for the 
Helmholtz equation, Inverse Probl. 3(1987), 149-180. 
[Ber] A. M. Berthier, Spectral Theory and Wave Operators for the Schrédinger 
Equation, Pitman, Boston, 1982. 
[Bor] D. Borthwick, Spectral Theory of Infinite-area Hyperbolic Spaces, Birkhauser, 
Boston, 2007. 
[Bs] V. Buslaev, Scattered plane waves, spectral asymptotics, and trace formulas in 
exterior problems, Dokl. Akad. Nauk. SSSR 197(1971), 1067-1070. 
[Co] D. Colton, The inverse scattering problem for time-harmonic accoustic waves, 
SIAM Rey. 26(1984), 323-350. 
[Co2] D. Colton, Partial Differential Equations, an Introduction, Random House, 
New York, 1988. 
[CK] D. Colton and R. Kress, Integral Equation Methods in Scattering Theory, Wiley, 
New York, 1983. 
[CK2] D. Colton and R. Kress, Inverse Acoustic and Electromagnetic Scattering The- 
ory, Springer, New York, 1992. 
[CH] R. Courant and D. Hilbert, Methods of Mathematical Physics I, J. Wiley, 
New York, 1966. 
[DZ] S. Dyatlov and M. Zworski, Mathematical Theory of Scattering Resonances, 
Amer. Math. Soc., Providence RI, 2019. 
[Ei] D. Eidus, On the principle of limiting absorption, Mat. Sb. 57(1962), 13-44. 
[Erd] A. Erdélyi, Asymptotic Expansions, Dover, New York, 1956. 
[HeR] J. Helton and J. Ralston, The first variation of the scattering matrix, J. Diff: 
Equations 21(1976), 378-394. 


[HiP] 
[Ho] 
[Isa] 

[JeK] 
[JM] 


[Kt] 
[Kir] 


[KZ] 
[KV] 
[LTU] 
[LP1] 
[LP2] 
[LP3] 


[LP4] 


[LP5] 
[MjR] 
[MjT] 
[Mel] 
[Me2] 


[Me3] 


[Me4] 
[MT1] 
[MT2] 

[Mr1] 


[Mr2] 


References 319 


E. Hille and R. Phillips, Functional Analysis and Semigroups, Colloq. Publ. 
Vol. 31, AMS, Providence, RI, 1957. 

L. Hérmander, The Analysis of Linear Partial Differential Operators, 
Vols. 3 and 4, Springer, New York, 1985. 

V. Isakov, Uniqueness and stability in multi-dimensional inverse problems, Jn- 
verse Probl. 9(1993), 579-621. 

A. Jensen and T. Kato, Asymptotic behavior of the scattering phase for exterior 
domains, Comm. PDE 3(1978), 1165-1195. 

D. Jones and X. Mao, The inverse problem in hard acoustic scattering, /nverse 
Probl. 5(1989), 731-748. 

T. Kato, Perturbation Theory for Linear Operators, Springer, New York, 1966. 
A. Kirsch, The domain derivative and two applications in inverse scattering the- 
ory, Inverse Probl. 9(1993), 81-96. 

A. Kirsch, R. Kress, P. Monk, and A. Zinn, Two methods for solving the inverse 
scattering problem, Jnverse Probl. 4(1988), 749-770. 

G. Kristensson and C. Vogel, Inverse problems for acoustic waves using the 
penalised liklihood method, Inverse Probl. 2(1984), 461-479. 

M. Lassas, M. Taylor, and G. Uhlmann, The Dirichlet-to-Neumann map for 
complete Riemannian manifolds with boundary, Comm. in Analysis and Ge- 
ometry 11 (2003), 207-221. 

P. Lax and R. Phillips, Scattering Theory, Academic, New York, 1967. 

P. Lax and R. Philips, Scattering theory, Rocky Mountain J. Math. 1(1971), 
173-223. 

P. Lax and R. Phillips, Scattering theory for the acoustic equation in an even 
number of space dimensions, Indiana University Math. J. 22(1972), 101-134. 
P. Lax and R. Phillips, The time delay operator and a related trace formula, in 
Topics in Func. Anal., I. Gohberg and M. Kac, eds., Academic, New York, 1978, 
pp. 197-215. 

P. Lax and R. Phillips, Scattering Theory for Automorphic Functions, Princeton 
University Press, Princeton, NJ, 1976. 

A. Majda and J. Ralston, An analogue of Weyl’s theorem for unbounded do- 
mains, Duke Math. J. 45(1978), 513-536. 

A. Majda and M. Taylor, Inverse scattering problems for transparent obstacles, 
electromagnetic waves, and hyperbolic systems, Comm. PDE 2(1977), 395-438. 
R. Melrose, Scattering theory and the trace of the wave group, J. Func. Anal. 
45(1982), 29-40. 

R. Melrose, Polynomial bound on the number of scattering poles, J. Func. Anal. 
53(1983), 287-303. 

R. Melrose, Weyl asymptotics for the phase in obstacle scattering, Comm. PDE 
11(1988), 1431-1439. 

R. Melrose, Geometric Scattering Theory, Cambridge University Press, 
Cambridge, 1995. 

R. Melrose and M. Taylor, Near peak scattering and the corrected Kirchhoff 
approximation for convex bodies, Adv. Math. 55(1985), 242-315. 

R. Melrose and M. Taylor, Boundary Problems for Wave Equations with Grazing 
and Gliding Rays. Monograph, Preprint, 2010. 

K. Miller, Least squares methods for ill-posed problems with a prescribed 
bound, SIAM J. Math. Anal. 1(1970), 52-74. 

K. Miller, Efficient numerical methods for backward solutions of parabolic 
problems with variable coefficients, in Improperly Posed Boundary Value Prob- 
lems, A.Carasso and A.Stone, eds., Pitman, London, 1975. 


320 9. Scattering by Obstacles 


[MrV] 
[Mw] 
[Mw2] 


[MRS] 


[MTW] 


[New] 
[Nus] 


[Olv] 
[PP] 


[RI] 
[Rm] 
[Rau] 

[RT] 
[RS] 
[Rog] 

[Si] 
[Si2] 
[Stk] 

[T1] 

[T2] 

[T3] 


[T4] 


[VV] 


[Vas] 


K. Miller and G. Viano, On the necessity of nearly-best possible methods for 
analytic continuation of scattering data, J. Math Phys. 14(1973), 1037-1048. 
C. Morawetz, The limiting amplitude principle, Comm. Pure Appl. Math. 
15(1962), 349-361. 

C. Morawetz, Exponential decay of solutions to the wave equation, Comm. Pure 
Appl. Math. 19(1966), 439-444. 

C. Morawetz, J. Ralston, and W. Strauss, Decay of solutions of the wave 
equation outside nontrapping obstacles, Comm. Pure Appl. Math. 30(1977), 
447-508. 

R. Murch, D. Tan, and D. Wall, Newton-Kantorovich method applied to two- 
dimensional inverse scattering for an exterior Helmholtz problem, Jnverse Probl. 
4(1988), 1117-1128. 

R. G. Newton, Scattering Theory of Waves and Particles, Springer, New York, 
1966. 

H. Nussensweig, High frequency scattering by an impenetrable sphere, Ann. 
Phys. 34(1965), 23-95. 

F. Olver, Asymptotics and Special Functions, Academic, New York, 1974. 

V. Petkov and G. Popov, Asymptotic behavior of the scattering phase for non- 
trapping obstacles, Ann. Inst. Fourier (Grenoble) 32(1982), 111-149. 

J. Ralston, Propagation of singularities and the scattering matrix, in Singulari- 
ties in Boundary Value Problems, H.Garnir, ed., D.Reidel, Dordrecht, 1981, pp. 
169-184. 

A. Ramm, Scattering by Obstacles, D.Reidel, Dordrecht, 1986. 

J. Rauch, Partial Differential Equations, Springer, New York, 1991. 

J. Rauch and M. Taylor, Potential and scattering theory on wildly perturbed 
domains, J. Func. Anal. 18(1975), 27-59. 

M. Reed and B. Simon, Methods of Mathematical Physics, Academic, New 
York, Vols. 1,2, 1975; Vols. 3,4, 1978. 

A. Roger, Newton-Kantorovich algorithm applied to an electromagnetic inverse 
problem, JEEE Trans. Antennas Propagat 29(1981), 232-238. 

B. Simon, Quantum Mechanics for Hamiltonians Defined as Quadratic Forms, 
Princeton University Press, Princeton, NJ, 1971. 

B. Simon, Trace Ideals and Their Applications, London Math. Soc. Lecture 
Notes no. 35, Cambridge University Press, Cambridge, 1979. 

I. Stakgold, Boundary Value Problems of Mathematical Physics, Macmillan, 
New York, 1968. 

M. Taylor, Propagation, reflection, and diffraction of singularities of solutions 
to wave equations, Bull. AMS 84(1978), 589-611. 

M. Taylor, Fourier integral operators and harmonic analysis on compact mani- 
folds, Proc. Symp. Pure Math. 35(pt. 2)(1979), 113-136. 

M. Taylor, Pseudodifferential Operators, Princeton University Press, Princeton, 
NJ, 1981. 

M. Taylor, Estimates for approximate solutions to acoustic inverse scattering 
problems, Inverse problems in wave propagation, 463-499, IMA Vol. Math. 
Appl. 90, Springer NY, 1997. 

V. K. Varadan and V. V. Varadan (eds.), Acoustic, Electromagnetic, and Elastic 
Wave Scattering — Focus on the T matrix Approach, Pergammon, New York, 
1980. 

A. Vasy, Propagation of singularities in three-body scattering, Asterisque #262, 
2000. 


References 321 


[Wat] G. Watson, Theory of Bessel Functions, Cambridge University Press, Cam- 
bridge, 1945. 
[Wil] C. Wilcox, Scattering Theory for the d’Alembert Equation in Exterior Domains, 
LNM no. 442, Springer, New York, 1975. 
[Yo] K. Yosida, Functional Analysis, Springer, New York, 1965. 


10 ® 


Check for 
updates 


Dirac Operators and Index Theory 


Introduction 


The physicist P. A. M. Dirac constructed first-order differential operators whose 
squares were Laplace operators, or more generally wave operators, for the purpose 
of extending the Schrodinger—Heisenberg quantum mechanics to the relativistic 
setting. Related operators have been perceived to have central importance in 
the interface between PDE and differential geometry, and we discuss some of 
this here. 

We define various classes of “Dirac operators,’ some arising on arbitrary 
Riemannian manifolds, some requiring some special geometrical structure, such 
as a spin structure, discussed in §3, or a spin‘ structure, discussed in §8. 

Dirac operators on compact Riemannian manifolds are elliptic and have an in- 
dex. Evaluating this index, in terms of an integrated “curvature,” is the essence 
of the famous Atiyah—Singer index theorem. We present a proof of this index 
formula here, using a “heat-equation” method of proof. Such a proof was first 
suggested in [McS], but it seemed difficult to carry out, as it required under- 
standing of a coefficient in the asymptotic expansion of the traces of e~'”s, for 
a pair of positive, second-order, elliptic operators L;, well below the principal 
term. Ingenious arguments, beginning with V. Patodi [Pt1, Pt2], led to a proof 
in Atiyah—Bott—Patodi [ABP]. Later, physicists, motivated by ideas from “super- 
symmetry,” proposed more direct heat-equation proofs. Such proposals were first 
made by E. Witten [Wit]; particularly elegant mathematical treatments were given 
by E. Getzler in [Gtl] and [Gt2]. We present a heat-equation proof in §6, using 
Getzler’s method of exploiting an analogue of Mehler’s formula for the expo- 
nential of the harmonic oscillator Hamiltonian. Our analytical details differ from 
Getzler’s; instead of introducing a noncommutative symbol calculus as in [Gt1], 
or the dilation argument of [Gt2], we fit the analysis more into a “classical” ex- 
amination of heat-equation asymptotics, such as dealt with in Chap. 7. One major 
achievement of Getzler’s approach is to make the appearance of the (rather subtle) 
A-genus of M in the index formula arise quite naturally. 

We present two specific examples of the index formula here. In §7 we de- 
rive the Chern—Gauss—Bonnet formula, giving the Euler characteristic y(M/) 


© The Author(s), under exclusive license to Springer Nature Switzerland AG 2023 323 
M. E. Taylor, Partial Differential Equations I, Applied Mathematical Sciences 116, 
https://doi.org/10.1007/978-3-03 1-33700-0_10 


324 10. Dirac Operators and Index Theory 


of a compact, orientable Riemannian manifold in terms of an integral of the 
“Pfaffian” applied to its curvature tensor. In §9 (following a discussion in 88 
of spin® structures) we discuss the Riemann—Roch formula, a tool for under- 
standing holomorphic (and meromorphic) sections of line bundles over Riemann 
surfaces (of real dimension 2), which is important in the study of the structure and 
function theory of Riemann surfaces. These are the simplest applications of the 
Atiyah—Singer formula; both were established well before the general formula. 
From a technical point of view, both have in common that the A-genus of M is 
effectively discarded. 

Other examples of the index formula include higher-dimensional 
Riemann-Roch formulas and signature formulas. Further material on these can 
be found in [Pal] and [Gil]. There is also an operator associated with “self-dual” 
connections on bundles over 4-manifolds, whose index plays an important role in 
the study of the Yang—Mills equations; see [AHS] and [FU]. For a variant, arising 
from the Seiberg—Witten equations, see [D] and [Mor]. 

The heat-equation proof of the Chern—Gauss—Bonnet theorem was Patodi’s 
[Pt1] first step in this circle of results. An exposition of the heat-equation proof 
of this result due to B. Simon is given in the last chapter of [CS]. Another proof 
of the Chern—Gauss—Bonnet theorem, celebrating closely physicists’ ideas about 
supersymmetry, is given in [Rog]. 

Due to the low dimension, one can give a direct proof of the Riemann—Roch 
theorem, using techniques of [McS]; such a proof is given in [Kot]. Such a direct 
approach, with a good bit more effort, could be expected to be effective in other 
low dimensions (e.g., complex dimension 2); in a sense, the sort of analysis re- 
quired to accomplish this is what was done in [Ko1]. In 810, we give a direct proof 
of an index formula for first-order, elliptic differential operators of Dirac type on 
a 2-manifold M, in terms of a direct calculation of the second term in the expan- 
sion of the heat kernel, carried out in §14 of Chap. 7, using the Weyl calculus. We 
show how this formula yields the Gauss—Bonnet formula and the Riemann—Roch 
formula. 

There are also other heat-equation proofs of the index theorem, particularly 
[Bil] and [BV]. In [Bi2] the heat equation method is applied to families of opera- 
tors; see also [Don] and [BiC]. There are several books devoted to expositions of 
heat-equation proofs of the index theorem, including [BGV, Gil, Mel], and [Roe]. 

A systematic “blow up” of the original proof of the Atiyah—Singer index 
theorem has led to the interesting subject of operator K-theory. An introductory 
exposition is given in Blackadar [Bl]. Further developments are described in 
[BDT, Con], and [BHS]. 

In §11 we change course, and produce an index formula for a class of elliptic 
k x k systems on Euclidean space IR”. We do this for the class of pseudodiffer- 
ential operators of harmonic oscillator type, introduced in §15 of Chap. 7. The 
proof here makes no use of heat-equation techniques. It uses some results from 
topology, particularly results on the homotopy groups of the unitary groups U(k), 
including the Bott periodicity theorem, results for which we refer to [Mil] for 
proof. Section 11 can be read independently of the other sections of this chapter. 


1. Operators of Dirac type 325 
1. Operators of Dirac type 


Let M be a Riemannian manifold, E; — M vector bundles with Hermitian met- 
rics. A first-order, elliptic differential operator 


(1.1) D:C™(M, Eo) — C™°(M, E;) 
is said to be of Dirac type provided D* D has scalar principal symbol. This implies 
(1.2) op*D(#, €) = 9(z, €)1 : Eox — Foz, 


where g(x,€) is a positive quadratic form on T*M. Thus g itself arises from 
a Riemannian metric on /. Now the calculation of (1.2) is independent of the 
choice of Riemannian metric on M/. We will suppose M is endowed with the 
Riemannian metric inducing the form g(x, €) on T* MV. 

If Eo = FE, and D = D*, we say D is asymmetric Dirac-type operator. Given 
a general operator D of Dirac type, if we set EF = Eo © Ej and define D on 
C™(M, E) as 


x 0 D* 
(1.3) b= () ar 


then D is a symmetric Dirac-type operator. 

Let (x, €) denote the principal symbol of a symmetric Dirac-type operator. 
With x € M fixed, set 0(€) = V(x, €). Thus v is a linear map from T* M = {€} 
into End(£,,), satisfying 


(1.4) B(E) = B(E)* 
and 
(1.5) WE =e, OL. 


Here, ( , ) is the inner product on T* M; let us denote this vector space by V. 
We will show how ¥ extends from V to an algebra homomorphism, defined on a 
Clifford algebra Cl(V, g), which we now proceed to define. 

Let V be a finite-dimensional, real vector space, g a quadratic form on V. 
We allow g to be definite or indefinite if nondegenerate; we even allow g to be 
degenerate. The Clifford algebra Cl(V, g) is the quotient algebra of the tensor 
algebra 


(1.6) ®V=REVEVEV)O(VEVEV)e--- 


by the ideal J C ®& V generated by 


326 10. Dirac Operators and Index Theory 
(1.7) {v®@wt+tw@v-—2v,w)-1l:v,weV}, 


where (, ) is the symmetric bilinear form on V arising from g. Thus, in 
Cl(V,g), V occurs naturally as a linear subspace, and there is the anti- 
commutation relation 


(1.8) vw + wu = 2(v,w)-1LinCl(V,g), v,w eV. 


We will look more closely at the structure of Clifford algebras in the next section. 
Now if ) : V + End(£) is a linear map of the V into the space of endomor- 
phisms of a vector space F, satisfying (1.5), i.e., 


(1.9) O(v)? = (v,v)I, ve, 
it follows from expanding 0(v + w)? = [9(v) + 0(w)]? that 
(1.10) O(v)O(w) + U(w)0(v) = 2(v, w)I, v,w eV. 


Then, from the construction of Cl(V, g), it follows that 0 extends uniquely to an 
algebra homomorphism 


(1.11) 8:CUV,g) — End(E), 0(1) =I. 


This gives & the structure of a module over Cl(V, g), or a Clifford module. If E 
has a Hermitian metric and (1.4) also holds, that is, 


(1.12) V(v) =V(v)*, VEY, 


we call & a Hermitian Clifford module. For this notion to be useful, we need g to 
be positive-definite. 

In the case where F = Eo & £ is a direct sum of Hermitian vector spaces, we 
say ahomomorphism 0 : Cl(V,g) — End(£) gives E the structure of a graded 
Clifford module provided (v) interchanges Eo and F, for uv € V, in addition to 
the hypotheses above. The principal symbol of (1.3) has this property if D is of 
Dirac type. 

Let us give some examples of operators of Dirac type. If V7 is a Riemannian 
manifold, the exterior derivative operator 


(1.13) d: AM — AJ*1M 
has a formal adjoint 
(1.14) 6=d*:AJ*1M +> APM, 


discussed in Chap. 2, §10, and in Chap. 5, 888 and 9. Thus we have 


1. Operators of Dirac type 327 
(1.15) d+6:A*M —+ A*M, 


where, with n = dim VM, 
AMM = QM. 
j=0 


As was shown in Chap. 2, (d + 6)*(d + 6) = d*d + dd” is the negative of the 
Hodge Laplacian on each A’ M, so (1.15) is a symmetric Dirac-type operator. 
There is more structure. Indeed, we have 


(1.16) d+6: "MM — Aw, 


If D is this operator, then D* = d+ 6: A%4M — A®"M, and an operator 
of type (1.3) arises. If MZ is compact, the operator (1.16) is Fredholm, with in- 
dex equal to the Euler characteristic of MM, in view of the Hodge decomposition. 
A calculation of this index in terms of an integrated curvature gives rise to the 
generalized Gauss—Bonnet formula, as will be seen in 87. 

Computations implying that (1.15) is of Dirac type were done in §10 of 
Chap. 2, leading to (10.22) there. If we define 


(1.17) Ay: MV NITY, Ag(vr As Avj) =UAUA + Av,;, 
on a vector space V with a positive-definite inner product, and then define 
(1.18) ty : AIt1V — AIV 


to be its adjoint, then the principal symbol of d+ 6 on V = T*M is 1/i times 
Ag — tg. That is to say, 


(1.19) iM (v) = Ay — by 
defines a linear map from V into End(A@V) which extends to an algebra homo- 
morphism 

M : Cl(V,g) — End(AéV). 
Given AyAw = — Aw Ay and its analogue for v, the anticommutation relation 
(1.20) M(v)M(w) + M(w)M(v) = 2(0, w)T 
follows from the identity 


(1.21) Aylw + lbwAy = (v, w)l. 


In this context we note the role that (1.21) played as the algebraic identity behind 
Cartan’s formula for the Lie derivative of a differential form: 


328 10. Dirac Operators and Index Theory 
(1.22) Lxa = d(a|X) + (da)| Xx; 
cf. Chap. 1, Proposition 13.1, and especially (13.51). 

Another Dirac-type operator arises from (1.15) as follows. Suppose dim M = 
n = 2k is even. Recall from Chap.5, §8, that d* = 6 is given in terms of the 
Hodge star operator on AJ M by 


d= (—1)9(--))4+5 * dx 

(1.23) 
= *d* ifn = 2k. 

Also recall that, on AJM, 

(1.24) 4? = (-1)-) = (-1)) ifn = 2k. 


Now, on the complexification AGM of the real vector bundle A* M, define 


(1.25) a: ALM — AZIM 
by 
(1.26) a=0-D+R, on ALM. 


It follows that 


(1.27) a=1 
and 
(1.28) a(d +6) = —(d+d)a. 


Thus we can write 


(1.29) AZM =A+tMO@A™M, witha = +I on ATM, 


and we have 


e0) Dy = +6: C%(M,A*) — C*(M, AF). 


Thus DE is an operator of Dirac type, with adjoint D;,. This operator is called 
the Hirzebruch signature operator, and its index is called the Hirzebruch signature 
of M. 

Other examples of operators of Dirac type will be considered in the following 
sections. 


1. Operators of Dirac type 329 


Both of the examples just discussed give rise to Hermitian Clifford modules. 
We now show conversely that generally such modules produce operators of Dirac 
type. More precisely, if MM is a Riemannian manifold, T° M has an induced inner 
product, giving rise to a bundle Cl(M/) — M of Clifford algebras. We suppose 
E — M isa Hermitian vector bundle such that each fiber is a Hermitian Cl, (1/)- 
module (in a smooth fashion). Let EF — M have a connection V, so 


(1.31) V :C™(M, E) —+ C®(M,T* ® E). 


Now if E£, is a Cl,,(M)-module, the inclusion T* © Cl, gives rise to a linear 
map 


(1.32) m:C~°(M,T* ® EF) —> C°(M, E), 

called “Clifford multiplication.” We compose these two operators; set 
(1.33) D=imoV:C™(M, E) — C™°(M,E). 

We see that, for v € Ez, 

(1.34) op(#,§)v=m(E@v) =€-v, 


so 7p (a, &) is |&|~ times an isometry on E,. Hence D is of Dirac type. 

If U is an open subset of M, on which we have an orthonormal frame {e,} 
of smooth vector fields, with dual orthonormal frame {v;} of 1-forms, then, for a 
section y of EF, 


(1.35) Dp =i) v;-Ve,y onu. 


Note that op(x,€)* = op(,€), so D can be made symmetric by altering it at 
most by a zero-order term. Given a little more structure, we have more. We say V 
is a “Clifford connection” on £ if V is a metric connection that is also compatible 
with Clifford multiplication, in that 


(1.36) Vx(v- 9) =(Vxv)-ptu- Vx, 


for a vector field X, a 1-form v, and a section y of FE. Here, of course, V xv arises 
from the Levi—Civita connection on M/. 


Proposition 1.1. /f V is a Clifford connection on E, then D is symmetric. 
Proof. Let y, ~ € C5°(M, E). We want to show that 


(1.37) / [(Dy, 4) - (p, Dd)] av =0. 


M 


330 10. Dirac Operators and Index Theory 


We can suppose yy, ~ have compact support in a set U on which local orthonormal 
frames e;, vu; as above are given. Define a vector field _X on U by 


(X,v) = (y,u-), veEAlV. 
If we show that, pointwise in U, 


then (1.37) will follow from the divergence theorem. Indeed, starting with 
(1.39) div X = 7 (Ve, X,0;), 


and using the metric and derivation properties of V, we have 


div X = [ey - (X,0;) — (X, Ve,,)] 


a DV lexte.ey -W) — (p, (Ve;¥j) 0)). 


Looking at the last quantity, we expand the first part into a sum of three terms, 
one of which cancels the last part, and obtain 


(1.40) div X = Ara -W) + (p, 0; ° Ve); 


which gives (1.38) and completes the proof. 


If E = Eo © E is a graded Hermitian Cl(M)-module, if Ep and E, are each 
provided with metric connections, and if (1.36) holds, then the construction above 
gives an operator of Dirac type, of the form (1.3). 

The examples in (1.15) and (1.30) described above can be obtained from 
Hermitian Clifford modules via Clifford connections. The Clifford module is 
A*M — M, with natural inner product on each factor A‘ M and Cl(M)-module 
structure given by (1.19). The connection is the natural connection on A*M, ex- 
tending that on T* /, so that the derivation identity 


(1.41) Vx(VAY)=(Vxv)AV+EA(Vxy¥) 


holds for a j7-form y and a k-form w. In this case it is routine to verify the com- 
patibility condition (1.36) and to see that the construction (1.33) gives rise to the 
operator d+ d* on differential forms. 

We remark that it is common to use Clifford algebras associated to negative- 
definite forms rather than positive-definite ones. The two types of algebras are 
simply related. If a linear map 0 : V — End(£) extends to an algebra homo- 
morphism Cl(V,g) — End(£), then i extends to an algebra homomorphism 
Cl(V,—g) — End(£). If one uses a negative form, the condition (1.12) that E 


2. Clifford algebras 331 


be a Hermitian Clifford module should be changed to ¥(v) = —V(v)*, v € V. 
In such a case, we should drop the factor of 2 in (1.33) to associate the Dirac-type 
operator D to a Cl(M)-module E. In fact, getting rid of this factor of ¢ in (1.33) 
and (1.35) is perhaps the principal reason some people use the negative-definite 
quadratic form to construct Clifford algebras. 


Exercises 


1. Let E be a Cl(M)-module with connection V. If y is a section of & and f is a scalar 
function, show that 


D(fe) = f De +i(df)-, 
where the last term involves a Clifford multiplication. 
2. If V is a Clifford connection on F and u is a 1-form, show that 


D(u- vy) = —u- Det 2iVuyp + i(Du) - », 
where U is the vector field corresponding to wu via the metric tensor on /, and 
DP :C"(M,A") + CO” (M, CI) 


is given by 
Du = i> 9; : VejU; 
with respect to local dual orthonormal frames e;, v;, and V arising from the Levi—Civita 
connection. 
3. Show that D(df) = iAf. 
Note: Compare with Exercise 6 of §2. 
4. If D arises from a Clifford connection on FE, show that 


D? (fy) = f D?p — 2V gna sy — (Af). 


2. Clifford algebras 


In this section we discuss some further results about the structure of Clifford al- 
gebras, which were defined in §1. 

First we note that, by construction, Cl(V, g) has the following universal prop- 
erty. Let Ap be any associative algebra over R, with unit, containing V as a linear 
subset, generated by V, such that the anticommutation relation (1.8) holds in Ag, 
for all v,w € V; that is, vw + wv = 2(v,w)- 1 in Ao. Then there is a natural 
surjective homomorphism 


(2.1) a:ClV,g) — Ao. 
If {e1,..., en} is a basis of V, any element of Cl(V, g) can be written as a 
polynomial in the e;. Since e;e, = —exe; + 2(e;, ex) - 1 and in particular e; = 


(e;,€;) +1, we can, starting with terms of highest order, rearrange each monomial 


332 10. Dirac Operators and Index Theory 


in such a polynomial so the e; appear with 7 in ascending order, and no exponent 
greater than 1 occurs on any e;. In other words, each element w € Cl(V,g) can 
be written in the form 


= a1 t 
(2.2) w= y ey a ace 


with real coefficients aj,...i,,. 

Denote by A the set of formal expressions of the form (2.2), a real vector space 
of dimension 2”; we have a natural inclusion V C A. We can define a “product” 
A@ A -» Ain which a product of monomials (e'? --- e’”)-(e7! --- ei”), with each 
zy, and each j,, equal to either 0 or 1, is a linear combination of monomials of such 
a form, by pushing each e7/" past the e’” for v > yz, invoking the anticommutation 
relations. It is routine to verify that this gives A the structure of an associative 
algebra, generated by V. The universal property mentioned above implies that A 
is isomorphic to C1(V, g). Thus each w € C1(V, g) has a unique representation in 
the form (2.2), and dim Cl(V, g) = 2” if dim V = n. 

Recall from §1 the algebra homomorphism M : Cl(V,g) — End(A*V), 
defined there provided g is positive-definite (which can be extended to include 
general g). Then, we can define a linear map 


(2.3) M :Cl(V,g) — A*V; M(w) = M(w)(1), 


for w € Cl(V,g). Note that if v € V C Cl(V,g), then M(v) = v. Comparing 
the anticommutation relations of Cl(V, g) with those of A*V, we see that if w € 
C1(V, g) is one of the monomials in (2.2), say w = e}' --- eJ”, all j, either 0 or 1, 
k= 9, +--+++ Jn, then 


(2.4) M (ce?) ..- ein) — el A+ Nein € APY, 


It follows easily that (2.3) is an isomorphism of vector spaces. This observation 
also shows that the representation of an element of Cl(V, g) in the form (2.2) is 
unique. If g is positive-definite and e,; is an orthonormal basis of V, the difference 
in (2.4) vanishes. 

In the case g = 0, the anticommutation relation (1.8) becomes vw = —wu, for 
v,w € V, and we have the exterior algebra 


C1(V,0) = A*V. 
Through the remainder of this section we will restrict attention to the case where 
g is positive-definite. We denote (v,v) by |v|?. For V = R” with g its standard 


Euclidean inner product, we denote Cl(V, g) by Cl(n). 
It is useful to consider the complexified Clifford algebra 


Cl(n) =C @Cl(n), 


as it has a relatively simple structure, specified as follows. 


2. Clifford algebras 333 


Proposition 2.1. There are isomorphisms of complex algebras 


(2.5) Cl(1)~¥C@C, Cl(2) = End(C?), 
and 
(2.6) Cl(n + 2) © Cl(n) @ Ci(2); 


hence, with k= 2", 
(2.7) Cl(2k) » End(C*), Cl(2k +1) End(C*) @ End(C*). 


Proof. The isomorphisms (2.5) are simple exercises. To prove (2.6), imbed R”*? 
into Cl(m) ® Cl(2) by picking an orthonormal basis {e1,..., €n42} and taking 


ej +> 16; @enz1eny2, forl <j <n, 
(2.8) 
ej+41@e,;, forj=n+lorn+2. 


Then the universal property of Cl(n + 2) leads to the isomorphism (2.6). Given 
(2.5) and (2.6), (2.7) follows by induction. 


While, parallel to (2.5), one has Cl(1) = R @ R and Cl(2) = End(R?), other 
algebras Cl(n) are more complicated than their complex analogues; in place of 
(2.6) one has a form of periodicity with period 8. We refer to [LM] for more on 
this. 

It follows from Proposition 2.1 that C2" has the structure of an irreducible 
Cl(2k)-module, though making the identification (2.7) explicit involves some un- 
tangling, in a way that depends strongly on a choice of basis. It is worthwhile to 
note the following explicit, invariant construction, for V, a vector space of real 
dimension 2k, with a positive inner product ( , ), endowed with one other piece 
of structure, namely a complex structure J. Assume J is an isometry for (, ). 
Denote the complex vector space (V, J) by V, which has complex dimension k. 
On V we have a positive Hermitian form 


(2.9) (u,v) = (u,v) + 2(u, Jv). 


Form the complex exterior algebra 
k . 
(2.10) Aev = Baty, 
j=0 


with its natural Hermitian form. For v € V, one has the exterior product vA : 
ALV — At y; denote its adjoint, the interior product, by j, : AL**V > Ady. 
Set 


334 10. Dirac Operators and Index Theory 
(2.11) iplvyp=vAp—-jy, vEV, pe Ay. 


Note that v A y is C-linear in v and j,y is conjugate linear in v, so j4(v) is only 
R-linear in v. As in (1.20), we obtain 


(2.12) yo(u)(v) + po(v)pa(u) = 2(u, v) + Z, 
sou: V — End(A@V) extends to a homomorphism of algebras 
(2.13) po: CUV,g) —> End(AGy), 

hence to a homomorphism of C-algebras 

(2.14) pe: CU(V,g) —> End(AGY), 


where Ci(V, g) denotes C ® CI(V, g). 


Proposition 2.2. The homomorphism (2.14) is an isomorphism when V is a real 
vector space of dimension 2k, with complex structure J, V the associated complex 
vector space. 


Proof. We already know that both Cl(V,g) and End(A@V) are isomorphic to 
End(C*), « = 2". We will make use of the algebraic fact that this is a complex 
algebra with no proper two-sided ideals. Now the kernel of jz in (2.14) would 
have to be a two-sided ideal, so either ~ = 0 or yu is an isomorphism. But for 
v EV, p(v)-1 =v, so w # 0; thus pz is an isomorphism. 


We next mention that a grading can be put on Cl(V, g). Namely, let C1°(V, g) 
denote the set of sums of the form (2.2) with i; + ---+ i, even, and let Cl'(V, g) 
denote the set of sums of that form with 7; + ---+ 7, odd. It is easy to see that 
this specification is independent of the choice of basis {e,; }. Also we clearly have 


(2.15) u€Cl(V,g), we CLV, g) => uw € CITF(V, g), 


where j and k are each 0 or 1, and we compute j + & mod 2. If (V, g) is R” with 
its standard Euclidean metric, we denote Cl) (V, g) by Cl/(n), j =Oor 1. 
We note that there is an isomorphism 


(2.16) 9: Cl(2k — 1) — Cl(2k) 


uniquely specified by the property that, for v € R?*-!, j(v) = vegp, where 
{e1,...,€2p—1} denotes the standard basis of R?*~1, with e2; added to form a 
basis of R?*. This will be useful in the next section for constructing spinors on 
odd-dimensional spaces. 

We can construct a finer grading on C1(V, g). Namely, set 


(2.17) CU*I(V,g) = set of sums of the form (2.2), with i; +--+ in =k. 


Exercises 335 


Thus Cl!°l(V, g) is the set of scalars and Cl!"](V, g) is V. If we insist that {e,} 
be an orthonormal basis of V, then Cll" (V, g) is invariantly defined, for all k. In 
fact, using the isomorphism (2.3), we have 


(2.18) CIV, g) = M-1(AFV). 
Note that 


cl(V,g) = B cl (V,g) and Cl(V, 9) = GB cl*l(V, 9). 
k even k odd 


Let us also note that Cll?](V, g) has a natural Lie algebra structure. In fact, if 
{e,;} is orthonormal, 


[exe;, Chee] = E,€;€nee — CneLeie; 
(2.19) 
_ 2(dj;ne:€0 = bj €i€k + Oikeee; = Oei€ne;)- 


The construction (2.17) makes Cl(V, g) a graded vector space, but not a graded 


algebra, since typically C1W(V, g)-Cll*l (V, g) is not contained in C1U+*I(V, g), 
as (2.19) illustrates. We can set 


(2.20) cl) (V, 9) = B{Cll(V, 9): 5 < k, j = kmod 2}, 


and then Cl) (V, g) « C1) (V, g) c CI9U+*) (V, g). As k ranges over the even or 
the odd integers, the spaces (2.20) provide filtrations of C1°(V, g) and Cl+(V, g). 


Exercises 
1. Let V have an oriented orthonormal basis {e1,--- , én}. Set 
(2.21) Vv =€1:-+é€n € CI(V,g). 


Show that v is independent of the choice of such a basis. 
Note: M(v) = 1 A--: A én € A"V, with M as in (2.3). 
2. Show that v? = (—1)"(*—-/2, 
3. Show that, for all w € V, vu = (—1)"~ uw. 
4. With py as in (2.11)-(2.14), show that 


uv)" = (A1yr"e @) and p(v)*p(v) = TI. 
5. Show that - ; 
M(vw) = cnr * M(w), 


for w € Cl®l(V,g), where « : A*V — A"~*V is the Hodge star operator. Find the 
constants Cnk. 
6. Let D : C°(M,T*) — C°(M, Cl) be as in Exercise 2 of §1, namely, 


336 10. Dirac Operators and Index Theory 


Dist Yo 05. Vez; 


where {e,;} is a local orthonormal frame of vector fields, {v;} the dual frame. Show 
that 
M(Dv) = —i(d+d*)v. 

7. Show that End(C”) has no proper two-sided ideals. (Hint: Suppose Mp 4 0 belongs 
to such an ideal Z and vo # 0 belongs to the range of Mo. Show that every v € C”™ 
belongs to the range of some M € 7, and hence that every one-dimensional projection 
belongs to Z.) 


3. Spinors 


We define the spinor groups Pin(V, g) and Spin(V, g), for a vector space V with 
a positive-definite quadratic form g; set |v|? = g(v, v) = (uv, v). We set 


(3.1) Pin(V, g) = {u1--- up € CUV,g) : v; € V, |v,| = 1}, 


with the induced multiplication. Since (v1 +--+ vx)(vg +++ v1) = 1, it follows that 
Pin(V, g) is a group. We can define an action of Pin(V,g) on V as follows. If 
u€ V anda € V, then uz + ru = 2(x, u) - 1 implies 


(3.2) uru = —cuu + 2(2,u)u = —lul?2 + 2(2, u)u. 
If also y € V, 
(3.3) (uru,uyu) = |u)*(x,y) = (x,y) if ful = 1. 


Thus if wu = v; --- ug, € Pin(V, g) and if we define a conjugation on C1(V, g) by 
(3.4) Ur =Up U1, U7 EV, 

it follows that 

(3.5) rreuru*, «rev, 

is an isometry on V for each u € Pin(V, g). It will be more convenient to use 
(3.6) ut = (—1)*u*, U = Vyp-+ + Up. 

Then we have a group homomorphism 

(3.7) 7: Pin(V,g) —> O(V,g), 


defined by 


3. Spinors 337 
(3.8) t(uja =uru*, 2 CV, uc Pin(V,g). 
Note that if v € V, |v| = 1, then, by (3.2), 
(3.9) T(v)a =a — 2(x,v)v 
is the reflection across the hyperplane in V orthogonal to v. It is easy to show 
that any orthogonal transformation T € O(V, g) is a product of a finite number of 
such reflections, so the group homomorphism (3.7) is surjective. 


Note that each isometry (3.9) is orientation reversing. Thus, if we define 


Spin(V, g) = {u1--- ve € CUV,g) : v; € V, |vj;| =1, k even} 


(3.10) 0 
= Pin(V,g) NCI(V, g), 
then 
(3.11) T : Spin(V, g) —> SO(V, g) 


and in fact Spin(V, g) is the inverse image of SO(V, g) under 7 in (3.7). We now 
show that 7 is a 2-fold covering map. 


Proposition 3.1. 7 is a 2-fold covering map. In fact, ker 7 = {+1}. 


Proof. Note that +1 € Spin(V,g) C Cl(V,g) and +1 acts trivially on V, via 
(3.8). Now, if u = v1---ug € ker7, k must be even, since T(w) must preserve 
orientation, so u# = u*. Since uxu* = x for all x € V, we have ux = xu, so 
uru = |u\?a2, x € V. If we pick an orthonormal basis {e1,...,e,} of V and 
write u € ker 7 in the form (2.2), each 71 + --- + %, even, since e;ue; = u for 
each 7, we deduce that, for each j, 


u= So(-1)% Diyenin etm ifue kerr. 


Hence 7; = 0 for all 7, so u is a scalar; hence u = +1. 
We next consider the connectivity properties of Spin(V, g). 


Proposition 3.2. Spin(V, g) is the connected 2-fold cover of SO(V, g), provided 
g is positive-definite and dim V > 2. 


Proof. It suffices to connect —1 € Spin(V,g) to the identity element 1 via a 
continuous curve in Spin(V, g). In fact, pick orthogonal e1, e2, and set 


y(t) =e, - [—(cost)e; + (sint)es], O<t<7. 


If V = R” with its standard Euclidean inner product g, denote Spin(V, g) by 
Spin(n). It is a known topological fact that SO(n) has fundamental group Z2, and 


338 10. Dirac Operators and Index Theory 


Spin(n) is simply connected, for n > 3. Though we make no use of this result, 
we mention that one route to it is via the “homotopy exact sequence” (see [BTu]) 
for S” = SO(n + 1)/SO(n). This leads to 7(SO(n + 1)) © m1 (SO(n)) for 
n > 3. Meanwhile, one sees directly that SU(2) is a double cover of SO(3), and 
it is homeomorphic to S°. 

We next produce representations of Pin(V,g) and Spin(V, g), arising from 
the homomorphism (2.13). First assume V has real dimension 2k, with complex 
structure J; let V = (V,.J) be the associated complex vector space, of complex 
dimension k, and set 


(3.12) S(V, 9,7) =Agy, 


with its induced Hermitian metric, arising from the metric (2.9) on V. The inclu- 
sion Pin(V, g) C C1(V, g) C CIV, g) followed by (2.14) gives the representation 


(3.13) p: Pin(V,g) —> Aut(S(V, g, J)). 


Proposition 3.3. The representation p of Pin(V, g) is irreducible and unitary. 


Proof. Since the C-subalgebra of Cl(V,g) generated by Pin(V,g) is all of 
Cl(V, g), the irreducibility follows from the fact that jz: in (2.14) is an isomor- 
phism. For unitarity, it follows from (2.11) that ju(v) is self-adjoint for v € V; by 
(2.12), u(v)? = |v|?I, so v € V, |v| = 1 implies that p(v) is unitary, and unitarity 
of p on Pin(V, g) follows. 


The restriction of p to Spin(V, g) is not irreducible. In fact, set 
(3.14) S4(Vi 9,0) =AE"V,  S_(V, 9, J) = ARV. 


Under p, the action of Spin(V,g) preserves both S, and S_. In fact, we have 
(2.14) restricting to 


(3.15) ue: CI°(V,g) — Ende (S4(V,9, J)) © Ende(S_(V, 9, J)), 
this map being an isomorphism. On the other hand, 


(3.16) 2€ CI(V,9) => ple): Si — Se. 


From (3.15) we get representations 


(3.17) Dy): Spin(V, g) —> Aut(Si(V, 9, J)), 


which are irreducible and unitary. 


3. Spinors 339 


If V = R?* with its standard Euclidean metric, standard orthonormal basis 
€1,---,€2k, we impose the complex structure Je; = ej+~, Jeizn = —Ei, 1 < 
4 <k, and set 


(3.18) SQ Ssh Pi) 6208) = Ser, | Fa). 


Then (3.17) defines representations 


(3.19) Dj» : Spin(2k) —> Aut(Si(2k)). 
We now consider the odd dimensional case. If V = R?*—!, we use the 
isomorphism 
(3.20) Cl(2k — 1) —+ Cl®(2k) 
produced by the map 
(3.21) VHD vex, ve RA, 


Then the inclusion Spin(2k — 1) C Cl(2k — 1) composed with (3.20) gives an 
inclusion 


(3.22) Spin(2k — 1) — Spin(2k). 


Composing with Dis from (3.19) gives a representation 
(3.23) Dio : Spin(2k — 1) —> Aut S$, (2k). 


We also have a representation D, ,. of Spin(2k — 1) on S_(2k), but these two 
representations are equivalent. They are intertwined by the map 


(3.24) pear) : S4(2k) 4 S_(2k). 


We now study spinor bundles on an oriented Riemannian manifold /, with 
metric tensor g. Over M lies the bundle of oriented orthonormal frames, 


(3.25) P—M, 
a principal SO(n)-bundle, n = dim M. A spin structure on M is a “lift,” 
(3.26) P— M, 


a principal Spin(n)-bundle, such that Pisa double covering of P in such a way 
that the action of Spin(n) on the fibers of P is compatible with the action of 


340 10. Dirac Operators and Index Theory 


SO(n) on the fibers of P, via the covering homomorphism 7 : Spin(n) + SO(n). 
Endowed with such a spin structure, M is called a spin manifold. There are topo- 
logical obstructions to the existence of a spin structure, which we will not discuss 
here (see [LM]). It turns out that there is a naturally defined element of H?(M, Z2) 
whose vanishing guarantees the existence of a lift, and when such lifts exist, 
equivalence classes of such lifts are parameterized by elements of H1(M, Zz). 
Given a spin structure as in (3.26), spinor bundles are constructed via the 
representations of Spin(n) described above. Two cases arise, depending on 
whether n = dim M is even or odd. If n = 2k, we form the bundle of spinors 


(3.27) S(P) = P x, S(2k), 


where p = Dig ® D,,. is the sum of the representations in (3.19); this is a sum 


of the two vector bundles 


(3.28) S4(P)=PxXp+ S+(2k). 
Recall that, as in §6 of Appendix C, on Connections and Curvature, the sections 
of S(P) are in natural correspondence with the functions f on P, taking values 
in the vector space (2k), which satisfy the compatibility conditions 


(3.29) f(p- 9) =e(9)'f), pe P, g € Spin(2k), 


where we write the Spin(n)-action on P as a right action. 
Recall that S(2k) is a Cl(2k)-module, via (2.13). This result extends to the 
bundle level. 


Proposition 3.4. The spinor bundle S(P) is a natural Cl(M)-module. 


Proof. Given a section u of Cl(M) and a section vy of S(P), we need to define 
u-y asa section of S(P), We regard u as a function on P with values in Cl(n) 
and ¢ as a function on P with values in S$ (n). Then w - y is a function on P with 
values in S(n); we need to verify the compatibility condition (3.29). Indeed, for 
pé P, g € Spin(2k), 


(3.30) = gu(p)9* gp(p) 


since gg* = 1 for g € Spin(n). This completes the proof. 


Whenever (IM, g) is an oriented Riemannian manifold, the Levi—Civita con- 
nection provides a connection on the principal SO(n)-bundle of frames P. If 14 
has a spin structure, this choice of horizontal space for P lifts in a unique natural 


3. Spinors 341 


fashion to provide a connection on P. Thus the spinor bundle constructed above 
has a natural connection, which we will call the Dirac—Levi—Civita connection. 


Proposition 3.5. The Dirac—Levi-Civita connection V on S(P) is a Clifford 
connection. 


Proof. Clearly, V is a metric connection, since the representation p of Spin(2k) 
on S(2k) is unitary. It remains to verify the compatibility condition (1.36), 
namely, 


(3.31) Vx(v- 9) =(Vxv)-ptu- Vx, 


for a vector field X, a 1-form v, and a section y of S(P). To see this, we first 
note that as stated in (3.30), the bundle Cl(M) can be obtained from P > M 
as P x,, Cl(2k), where «. is the representation of Spin(2k) on Cl(2k) given by 
k(g)w = gwg*. Furthermore, T* M can be regarded as a subbundle of Cl(M), 
obtained from P x ., R2* with the same formula for «. The connection on T* M 
obtained from that on P is identical to the usual connection on T* M defined via 
the Levi—Civita formula. Given this, (3.31) is a straightforward derivation identity. 


Using the prescription (1.31)-(1.33), we can define the Dirac operator on a 
Riemannian manifold of dimension 2k, with a spin structure: 


(3.32) D:C™(M,S(P)) — C®(M, S(P)). 
We see that Proposition |.1 applies; D is symmetric. Note also the grading: 


(3.33) D: C™(M, S4(P)) — C®(M, Sz(P)). 


In other words, this Dirac operator is of the form (1.3). 

On a Riemannian manifold of dimension 2k with a spin structure PoM , let 
F — M be another vector bundle. Then the tensor product EF = S$ (P) ® Fisa 
Cl(M)-module in a natural fashion. If F’ has a connection, then E£ gets a natural 
product connection. Then the construction (1.31)—(1.33) yields an operator Dp of 
Dirac type on sections of £; in fact 


(3.34) Dp: C®(M, Es) — C%(M,Ez), Es =Si(P)@F. 


If F has a metric connection, then & gets a Clifford connection. The operator Dr 
is called a twisted Dirac operator. Sometimes it will be convenient to distinguish 
notationally the two pieces of Dy; we write 


Di: C™(M, Ex) —+ C™(M, E_), 


(3.35) . 
Dz7C™(M, E_) — C™(M, B,). 


342 10. Dirac Operators and Index Theory 


When dim M = 2k — 1 is odd, we use the representation (3.23) to form the 

bundle of spinors 7 7 
Si(P)=P “De S,(2k). 

The inclusion Cl(2k — 1) — Cl°(2k) defined by (3.20)-(3.21) makes S$; (2k) a 
Cl(2k — 1)-module, and analogues of Propositions 3.4 and 3.5 hold. Hence there 
arises a Dirac operator, D : C°(M, S,(P)) + C~(M, S,(P)). Twisted Dirac 
operators also arise; however, in place of (3.34), we have Dr : C@(M, E,) > 
C°(M, E,.), with E, = S,(P) @ F. 


Exercises 


1. Verify that the map (3.15) is an isomorphism and that the representations (3.17) of 
Spin(V, g) are irreducible when dim V = 2k. 
2. Let v be as in Exercises 1-4 of §2, with n = 2k. Show that 


a) the center of Spin(V, g) consists of {1,—1, v, —v}, 

b) p(v) leaves S4 and S_ invariant, 

c) u(v) commutes with the action of Cl°(V, g) under js, hence with the represen- 
tations D;’,. of Spin(V, g), 


d) y(v) acts as a pair of scalars on S and S_, respectively. These scalars are the 
two square roots of (—1)*. 


3. Calculate ju(v) - 1 directly, making use of the definition (2.11). Hence match the scalars 
in exercise 2d) to S; and S_. (Hint: p(er41---e2nr)- 1 = (—i)Fexsi Aw A 
€or in ARY. Using ej+n = te; in V, forl < 7 < k, we have 


u(y) -1 = wler---ex)(e1 A+ A ek), 


and there are k interior products to compute.) 

4. Show that C1! (V, g), with the Lie algebra structure (2.19), is naturally isomorphic to 
the Lie algebra of Spin(V, g). In fact, if (aj) is a real, antisymmetric matrix, in the 
Lie algebra of SO(n), which is the same as that of Spin(n), show that there is the 
correspondence 


1 
A = (ajrn) i So ajr €; Ck = K(A). 


In particular, show that «(Ai A2 — A2A1) = &(A1)&(Az2) — K(A2)K(A1). 

5. If X is a spin manifold and M C X is an oriented submanifold of codimension 1, 
show that M has a spin structure. Deduce that an oriented hypersurface in R” has a 
spin structure. 


4. Weitzenbock formulas 


Let E — M be a Hermitian vector bundle with a metric connection V. Suppose 
E is also a Cl(M)-module and that V is a Clifford connection. If we consider 
the Dirac-type operator D : C™(M, E) + C™(M, E) and the covariant deriva- 
tive V : C°(M, E) — C®(M,T* ® E), then D? and V*V are operators on 


4. Weitzenbock formulas 343 
C@™(M, F) with the same principal symbol. It is of interest to examine their diff- 
erence, clearly a differential operator of order < 1. In fact, the difference has 


order 0. This can be seen in principle from the following considerations. From 
Exercise 4 of §1, we have 


(4.1) D?( fp) = f D?p — 2Veraa po — (Affe 


when y € C™(M, FE), f a scalar function. Similarly, we compute V*V (fy). 
The derivation property of V implies 


(4.2) V( fe) = fVet df ® ¢. 

To apply V* to this, first a short calculation gives 

(4.3) V" f(u@y) = fV"(u® vy) — (df, we, 
foru € C™(M,T*), p € C™(M, E), and hence 

(4.4) V* (FV ye) = fV* Ve — Vera $Y: 


This gives V* applied to the first term on the right side of (4.2). To apply V* to 
the other term, we can use the identity (see Appendix C, (1.35)) 


(4.5) V*(u® y) = —Vuy — (div Uy, 

where U is the vector field corresponding to u via the metric on M/. Hence 
(4.6) V* (df ® y) = —Vena sp — (Af) 9. 

Then (4.6) and (4.4) applied to (4.2) gives 

(4.7) VV (fe) = FV"VE — 2WVeraa fe — (AS) 9. 
Comparing (4.1) and (4.7), we have 

(4.8) (D? — V*V)(fe) = f(D? - V"V)e, 


which implies D? — V*V has order zero, hence is given by a bundle map on FE. 
We now derive the Weitzenbock formula for what this difference is. 


Proposition 4.1. If E — M is a Cl(M)-module with Clifford connection and 
associated Dirac-type operator D, then, for p © C™(M, E), 


(4.9) D?p =V*Ve— > _ unv; K (en, €;)9, 
j>k 


344 10. Dirac Operators and Index Theory 


where {e;} is a local orthonormal frame of vector fields, with dual frame field 
{vj}, and I< is the curvature tensor of (E, V). 


Proof. Starting with Dy = i >) vj; Vc, ¥, we obtain 
D’y =— ys UKV ex (u;Ve, y) 
jk 


=— S- Uk [piVen Ves? a (Ve.04) Ve, | ? 
j,k 


(4.10) 


using the compatibility condition (1.36). We replace Ve, Ve, by the Hessian, 
using the identity 


(4.11) Vines 2 = Vex Ves — Vive, e;¥3 


Ck, ej 
cf. (2.4) of Appendix C. We obtain 
D*y = — by UkV;V 24,¢5P 
iik 


— S- Uk [eV ej P + (Veet) Ves | : 
jk 


(4.12) 


Let us look at each of the two double sums on the right. Using v5 = 1 and the 


anticommutator property u,v; = —v;vz for k # j, we see that the first double 
sum becomes 
(4.13) — yy Ve. e,? -S> Upv; K (ex, €5)Y, 

j>k 


since the antisymmetric part of the Hessian is the curvature. This is equal to the 
right side of (4.9), in light of the formula for V*V established in Proposition 2.1 
of Appendix C. As for the remaining double sum in (4.12), for any p € M, we 
can choose a local orthonormal frame field {e;} such that V.;e,% = 0 at p, and 
then this term vanishes at p. This proves (4.9). 


We denote the difference D? — V*V by K, so 
(4.14) (D?-V*V)p=Ky, Ke C™(M,End E). 


The formula for K in (4.9) can also be written as 


(4.15) -_ ~g om (€x, €;)Y- 


4. Weitzenbock formulas 345 


Since a number of formulas that follow will involve multiple summation, we will 
use the summation convention. 

This general formula for K simplifies further in some important special cases. 
The first simple example of this will be useful for further calculations. 


Proposition 4.2. Let E = A* M, with Cl(M)-module structure and connection 
described in §1, soK € C™(M, End A*). In this case, 


(4.16) u€ A'M = Ku= Ric(u). 


Proof. The curvature of A*M is a sum of curvatures of each factor A* M. In 
particular, if {e;,v,;} is a local dual pair of frame fields, 


(4.17) K (ei, €j)Ur = —R* eizve, 


where R* gig are the components of the Riemann tensor, with respect to these 
frame fields, and we use the summation convention. In light of (4.15), the desired 
identity (4.16), will hold provided 


1 
(4.18) 5 vivjreR® eis = Ric(vg), 


so it remains to establish this identity. Since, if (i, 7,2) are distinct, vivjve = 
UguyU; = U;UeU;, and since by Bianchi’s first identity 


Ri yg + R je + R¥ ge =0, 


it follows that in summing the left side of (4.18), the sum over (7, 7, 2) distinct 
vanishes. By antisymmetry of R* eij, the terms with 2 = 7 vanish. Thus the only 
contributions arise from i = @ # 7 andi # & = J. Therefore, the left side of 
(4.18) is equal to 


(4.19) (—0; Rij + Ui RR j;) = YR jij = Ric(vz), 


1 
2 
which completes the proof. 


We next derive Lichnerowicz’s calculation of K when E = S$ (P), the spinor 
bundle of a manifold / with spin structure. First we need an expression for the 
curvature of S(P). 


Lemma 4.3. The curvature tensor of the spinor bundle S (P) is given by 


1_, 
(4.20) K (ei, €))9 = ZR agvever. 


346 10. Dirac Operators and Index Theory 


Proof. This follows from the relation between curvatures on vector bundles and 
on principal bundles established in Appendix C, 86, together with the identifica- 
tion of the Lie algebra of Spin(n) with Cl?! (n) given in Exercise 4 of §3. 


Proposition 4.4. For the spin bundle S(P), K € C®(M, End S(P)) is given by 
1 
(4.21) Kp= Tone 


where S is the scalar curvature of M. 


Proof. Using (4.20), the general formula (4.15) yields 


1p 1 : 
(4.22) Kg= — Reais U;VjURVEP = guiviueR eis URPs 
the last identity holding by the anticommutation relations; note that only the sum 
over k £ £ counts. Now, by (4.18), this becomes 


1 : 
Ke — TUiveER® jag? 


1 
(4.23) = Ric;;p (by symmetry) 


1 
=-_-$§ 
4 > 


completing the proof. 


We record the generalization of Proposition 4.4 to the case of twisted Dirac 
operators. We mention that one often sees a different sign before the sum, due to 
a different sign convention for Clifford algebras. 


Proposition 4.5. Let E — M have a metric connection V, with curvature Re, 
For the twisted Dirac operator on sections of F = S(P) ® E, the section K of 
End F has the form 


1 1 
(4.24) Kyp= rikg ye viv; R® (e;, €;)Y. 
ay) 


Proof. Here R¥ (e;,¢;) is shorthand for 1@ R” (e;, e;) acting on S(P) @ E. This 
formula is a consequence of the general formula (4.15) and the argument proving 
Proposition 4.4, since the curvature of S(P) @ Eis K @I+1@ R®, K being 
the curvature of S(P), given by (4.20). 


These Weitzenbock formulas will be of use in the following sections. Here we 
draw some interesting conclusions, due to Bochner and Lichnerowitz. 


Exercises 347 


Proposition 4.6. [f M is compact and connected, and the section K in (4.14)- 
(4.15) has the property that K > 0 on M and K > 0 at some point, then ker 
D=0. 


Proof. This is immediate from 


(D9, 9) = (Ky, v) + ||VellZ2- 


Proposition 4.7. If M is a compact Riemannian manifold with positive Ricci ten- 
sor, then b;(M) = 0, that is, the deRham cohomology group H'(M,R) = 0. 


Proof. Via Hodge theory, we want to show that if u € A'(M) and du = d*u=0, 
then u = 0. This hypothesis implies Du = 0, where D is the Dirac-type operator 
dealt with in Proposition 4.2. Consequently we have, for a 1-form u on M, 


(4.25) ||Dul|Z2 = (Ric(u), uw) + ||VullZ2, 
so the result follows. 


Proposition 4.8. If M is a compact, connected Riemannian manifold with a spin 
structure whose scalar curvature is > 0 on M and > 0 at some point, then M 
has no nonzero harmonic spinors, that is, ker D = 0 in C*(M, S(P)). 


Proof. In light of (4.21), this is a special case of Proposition 4.6. 


Exercises 


1. Let A be the Laplace operator on functions (0-forms) on a compact Riemannian man- 
ifold M, A; the Hodge Laplacian on k-forms. If Spec(—A) consists of 0 = Ao < 
Ai < A2 < +--+, show that Ay € Spec(—A1). 

2. If Ric > col on M, show that A; > co. 

3. Recall the deformation tensor of a vector field wu: 


1 


5 (Vu Vu’), Def: C~(M,T) > C™(M, 8S”). 


il 
Def u = ~Lug = 
efu= 5 Lug 


Show that 
Def*v = — div v, 


where (div v)? = v/*.,,. Establish the Weitzenbock formula 


(4.26) 2 div Def u = —V*Vu-+t grad div u+ Ric(w). 


The operator div on the right is the usual divergence operator on vector fields. (This 
formula will appear again in Chap. 17, in the study of the Navier-Stokes equation.) 
4. Suppose M is a compact, connected Riemannian manifold, whose Ricci tensor satisfies 


(4.27) Ric(z) <Oon M,  Ric(xo) < 0, for some xo € M. 


348 10. Dirac Operators and Index Theory 


Show that the operator Def is injective, so there are no nontrivial Killing fields on M, 
hence no nontrivial one-parameter groups of isometries. (Hint: From (4.26), we have 


(4.28) 2\|Def ul|72 = ||Vullz2 + |ldiv ull” — (Ric(u), u) ,»-) 


5. As shown in (3.39) of Chap.2, the equation of a conformal Killing field on an 
n-dimensional Riemannian manifold MM is 


(4.29) Def X — = (aiv X)g =0. 


Note that the left side is the trace-free part of Def X € C'™°(M, S?T*). Denote it by 
DrrX. Show that 


(4.30) Dpp =— div DrrDrrX = — div Def X + Z (grad div X), 
n 


S2T*? 


where Sé7™ is the trace-free part of S?7T'*. Show that 


1 t G\a 1. 
(4.31) ||DrrX||Z2 = gllVXllz2 + € - *) \|div X||F2 — 5 (Rie(X), X) 


2 on L? 


Deduce that if M/ is compact and satisfies (4.27), then M has no nontrivial one- 
parameter group of conformal diffeomorphisms. 

6. Show that if 1/7 is a compact Riemannian manifold which is Ricci flat (1.e., Ric = 0), 
then every conformal Killing field is a Killing field, and the dimension of the space of 
Killing fields is given by 


(4.32) dime ker Def = dim H'(M,R). 
(Hint: Combine (4.25) and (4.28).) 


7. Suppose dim M = 2 and M is compact and connected. Show that, for u € 
OPM, S27, 


* 1 - 
|Divulie = 5lVullie + f Klul? av, 
M 


where Kc is the Gauss curvature. Deduce that if K > 0 on M, and K (zo) > 0 for some 
xo € M, then Ker Dp = 0. Compare with Exercises 6-8 of §10. 
8. If uw and v are vector fields on a Riemannian manifold /, show that 


(4.33) div Viv = Vu(div v) + Tr((Vu)(Vv)) — Ric(u, v). 


Compare with formula (3.17) in Chap. 17, on the Euler equation. Relate this identity to 
the Weitzenbock formula for A on 1-forms (a special case of Proposition 4.2). 


5. Index of Dirac operators 


If D : C°(M, Eo) > C%(M, E;) is an elliptic, first-order differential operator 
between sections of vector bundles Ep and /; over a compact manifold /, then, 
as we have seen, D : H*+!(M, Ey) ~ H*(M, E;) is Fredholm, for any real k. 
Furthermore, ker D is a finite-dimensional subspace of C°° (M, Eo), independent 


5. Index of Dirac operators 349 


of k, and D* : H~-*(M,E,) — H~*~1(M, Eo) has the same properties. A 
quantity of substantial importance is the index of D: 


(5.1) Index D = dim ker D— dim ker D*. 


In this section and the next we derive a formula for this index, due to Atiyah and 
Singer. Later sections will consider a few applications of this formula. 

One basic case for such index theorems is that of twisted Dirac operators. 
Thus, let F' — M be a vector bundle with metric connection, over a compact 
Riemannian manifold M with a spin structure. Assume dim M = n = 2k is 
even. The twisted Dirac operator constructed in §3 in particular gives an elliptic 
operator 


(5.2) Dr: C®(M, S,(P) @ F)  C~(M, S_(P) ® F). 
The Atiyah—Singer formula for the index of this operator is given as follows. 


Theorem 5.1. /f M is a compact Riemannian manifold of dimension n = 2k with 
spin structure and D pf the twisted Dirac operator (5.2), then 


(5.3) Index Dp = (A(M) Ch(F), [M]). 


What is meant by the right side of (5.3) is the following. A(M) and Ch(F) 
are certain characteristic classes; each is a sum of even-order differential forms 
on M, computed from the curvatures of S' (P) and F’, respectively. We will derive 
explicit formulas for what these are in the course of the proof of this theorem, 
in the next section, so we will not give the formulas here. The pairing with MZ 
indicated in (5.3) is the integration over M of the form of degree 2k = n arising 
in the product A(M)Ch(F). 

The choice of notation in A(M/) and Ch(F) indicates an independence of such 
particulars as the choice of Riemannian metric on M and of connection on F’. This 
is part of the nature of characteristic classes, at least after integration is performed; 
for a discussion of this, see §7 of Appendix C. There is also a simple direct reason 
why Index Dp does not depend on such choices. Namely, any two Riemannian 
metrics on M/ can be deformed to each other, and any two connections on F' can 
be deformed to each other. The invariance of the index of Dp is thus a special 
case of the following. 


Proposition 5.2. If D,, 0 < s < 1, is a continuous family of elliptic differential 
operators D, : C°(M,E 9) > C®(M, E;) of first order, then Index Dg is 
independent of s. 


Proof. We have a norm-continuous family of Fredholm operators D, 
H'(M, Ey) — L?(M, E;); the constancy of the index of any continuous family 
of Fredholm operators is proved in Appendix A, Proposition 7.4. 


The proof of Theorem 5.1 will be via the heat-equation method, involving a 
comparison of the spectra of D* D and DD*, self-adjoint operators on L?(M, Eo) 


350 10. Dirac Operators and Index Theory 


and L? (M, F1), respectively. As we know, since D* D = Lo and DD* = Ly are 
both elliptic and self-adjoint, they have discrete spectra, with eigenspaces of finite 
dimension, contained in C°(M, E;), say 


(5.4) Eigen(L;,A) = {ue C~(M, E;) : Lj;u = Au}. 
We have the following result: 


Proposition 5.3. The spectra of Lg and Ly are discrete subsets of [0,0o) which 
coincide, except perhaps at 0. All nonnero eigenvalues have the same finite 
multiplicity. 

Proof. It is easy to see that for each A € [0,00), D : Eigen(LZo,A) 
Eigen(L,, A) and D* : Eigen(L, \) + Eigen(Lo, A). For \ #0, D and \~!D* 


are inverses of each other on these spaces. 


We know from the spectral theory of Chap. 8 that y(Lo) and y(L1) are trace 
class for any y € S(IR). We hence have the following. 


Proposition 5.4. For any y € S(R), with y(0) = 1, 

(5.5) Index D = Tr p(D* D) — Tr p(DD*). 
In particular, for any t > 0, 

(5.6) Index D = Tr e~*?’? — Tr e7tPP”, 


Now, whenever D is of Dirac type, so D* D = Lo and DD* = Ly have scalar 
principal symbol, results of Chap. 7 show that 


(5.7) e iu(x) = fre (t,x, y) u(y) dV(y), 
M 


with 
(5.8) kj(t,v, 2) ~ pn? [aj0(e) +aji(a)t+-+-+ aje(ax)t +... ; 


ast \, 0, with aj € C°°(M, End E;), so 


(5.9) Tr eth Nv ton/2 (bj0 + byit ferret byot! +:-- ), 
with 
(5.10) bye = / Tr aje(x) dV (a). 


M 


6. Proof of the local index formula 351 


In light of (5.6), we have the following result: 


Proposition 5.5. [f D is of Dirac type on M, of dimension n = 2k, then 


(5.11) Index D = box — big = / Tr [aox(x) — aix(ax)|dV (x), 
M 


where aj; are the coefficients in (5.8). 


We remark that these calculations are valid for dim MJ = n odd. In that case, 
there is no coefficient of t° in (5.8) or (5.9), so the identity (5.6) implies Index 
D = 0 for dim M odd. In fact, this holds for any elliptic differential operator, not 
necessarily of Dirac type. On the other hand, if dim M is odd, there exist elliptic 
pseudodifferential operators on M with nonzero index. 

We will establish the Atiyah—Singer formula (5.3) in the next section by show- 
ing that, for a twisted Dirac operator Dy, the 2k-form part of the right side of 
the formula (5.3), with A(M/) and Ch(F) given by curvatures in an appropriate 
fashion, is equal pointwise on M to the integrand in (5.11). Such an identity is 
called a local index formula. 


6. Proof of the local index formula 


Let Dr be a twisted Dirac operator on a compact spin manifold, as in (5.2). If 
Lo = D7,Dr and L, = Dr D7, we saw in §5 that, for all t > 0, 


(6.1) Index Dp = Jin ko(t, 2,2) — Tr ki(t,x,x)| dV(a), 

M 
where k(t, x, y) are the Schwartz kernels of the operators e~ ‘”;. In the index for- 
mula stated in (5.3), A(M) and Ch(F) are to be regarded as differential forms on 
M, arising in a fashion we will specify later in this section, from curvature forms 
given by the spin structure on / and a connection on F’; the product is the wedge 
product of forms. The following is the local index formula, which refines (5.3). 


Theorem 6.1. For the twisted Dirac operator Dp, we have the pointwise identity 


(6.2) lim [Tr ko(t, 2,2) — Tr ky(t,2,x)| dV = {A(M) ACK(F)}., 


where {3}, denotes the component of degree n = dim M of a differential form 
3, and dV denotes the volume form of the oriented manifold M. 


We first obtain a formula for the difference in the traces of ko(t, 2,2) and of 
k(t, x, x), which are elements of End((S), ® F;,). It is convenient to put these 
together, and consider 


(6.3) K= G °) € End(S @ F), 
0 ky 


352 10. Dirac Operators and Index Theory 


where S' = S41 @S_, and we have dropped x and ¢. Using the isomorphism (2.14), 
pe: Cl(2k) > End S, we can write 


(6.4) End(S @ F) = Cl(2k) ® End(F). 


We will suppose dim M = n = 2k. In other words, we can think of an element of 
End(S ® F’) as a combination of elements of the Clifford algebra, whose coeffi- 
cients are linear transformations on F’. Since (6.3) preserves S41 © F and S_ @ F, 
we have 


(6.5) K € Cl°(2k) @ End(F). 


For K of the form (6.3), the difference Tr kg— Tr k, is called the “supertrace” 
of K, written 


(6.6) St kK =Tr(eK), withe= t ) : 


The first key step in establishing (6.2) is the following identity, which arose in the 
work of F. Berezin [Ber] and V. Patodi [Pt1]. Define the map 


(6.7) 7: Cl(2k) —+C 


to be the evaluation of the coefficient of the “volume element” v = e1--: e9z, 
introduced in Exercises 1-4 of §2. Similarly define 


(6.8) tr: Cl(2k)® End(F’) —> End F, 7: Cl(2k) ® End(F) — C 
to be 

(6.9) Tr =T@1, T= Tr oTp, 

where the last trace is Tr : End F' > C. 

Lemma 6.2. The supertrace is given by 

(6.10) Str K = (-2i)*7(K), 

using the identification (6.4). 


Proof. If this is established for the case F' = C, the general case follows easily. 
We note that, with vy = e€1 --- eax, 


Si ={2e S: p(i*v)xe = +2}. 


Thus, for k € Cl(2k), 


6. Proof of the local index formula 353 
(6.11) Ste K = Tr(i*vK). 
Thus (6.10) is equivalent to 


(6.12) Tr w = 2" wo, 


for w € Cl(2k) = End S, where wo is the scalar term in the expansion (2.2) for 
w. This in turn follows from 


(6.13) Tr1 =2* 
and 
(6.14) Treit---e* =0 iff, +---+i,>0, 1, =Oorl. 


To verify these identities, note that 1 acts on S as the identity, so (6.13) holds 
by the computation of dim S. As for (6.14), using S @ S’ =~ CI(2k), we see 
that (6.14) is a multiple of the trace of e/' ---e% acting on Cl(2k) by Clifford 
multiplication, which is clearly zero. The proof is complete. 


Thus we want to analyze the Cll?" (2k) @ End F component of K(t, x, x), the 
value on the diagonal of K(t, x, y), the Schwartz kernel of 


—tL 
etDr z=i[ © ° 0 
0 e thi 


We recall that a construction of K (t,x, y) was made in Chap. 7, §13. It was shown 
that, in local coordinates and with a local choice of trivializations of S(P) and of 
fF’, we could write, modulo a negligible error, 


(6.15) eM Pu(a) = (20)? / a(t,w, €)a(gye' dé, 
where the amplitude a(t, x, €) has an asymptotic expansion 
(6.16) a(t,x,€)~ S—aj(t,2, €). 

j20 


The terms a;(t,x,€) were defined recursively in the following manner. If, with 
such local coordinates and trivializations, 


(6.17) De = Lis, Ds), 


then, by the Leibniz formula, write 


354 10. Dirac Operators and Index Theory 


2 
= 'S | La(a, €)alt,x,§) +) | Brea, £, Da)alt,x,§)] , 


f=1 


where B2_¢(x, €, D,.) is a differential operator (of order 2) whose coefficients are 
polynomials in €, homogeneous of degree 2— @in €. L(x, €) is the principal 
symbol of L = D%,. 

Thus we want the amplitude a(t, x, €) in (6.15) to satisfy formally 


2, 
(6.19) = ~ —L2a— > Bo_o(2,€, Dz)a. 
é=1 


If a is taken to have the form (6.16), we produce the following transport equations 
for aj: 


Oag 


(6.20) BE = —Lo(x, €)ao(t, 2, €) 
and, for 7 > 1, 
(6.21) ma = —Lo(z, €)a; + O;(t, x, €), 
where 
2 
(6.22) 2,0 =— >). Bose, 6D, jaja, 2,2), 
e=1 


By convention, we set a_; = 0. So that (6.15) reduces to Fourier inversion at 
t = 0, we set 


(6.23) ag(0,2,€) = 1, a;(0,2,€) = 0, for7 > 1, 
Then we have 
(6.24) mete ee), 


The solution to (6.21) is 
t 
(6.25) ateei= | ele-DL288) 0.(5, 0, €) ds. 


Now, as shown in Chap. 7, we have 


6. Proof of the local index formula 355 


(6.26) Trey 5” Te // a;(t, a, &) dé dx, 


j20 


with 

(6.27) foutt29) dé = tr D/5 (2). 

Furthermore, the integral (6.27) vanishes for 7 odd. Thus we have the expansion 
(6.28) K(t,2,2) ~ t-"/? [a0() + ay(x)t+++++ae(2)t® +--+], 


Our goal is to analyze the C/!?*] @ End F component of a;(x), with n= 2k. 
In fact, the way the local index formula (6.2) is stated, the claim is made that 
ag(x) has zero component in this space, for 2 < k. The next lemma gives a more 
precise result. Its proof will also put us in a better position to evaluate the trea- 
sured C/l?] @ End F component of a; (a). Recall the filtration (2.20) of Cl°(2k); 
complexification gives a similar filtration of C/(2k). 


Lemma 6.3. In the expansion (6.28), we have 
(6.29) a;(x) € CIP) (2k) @ End F, O< 5 <k. 


In order to prove this, we examine the expression for L = D2, in local coordi- 
nates, with respect to convenient local trivializations of S' (P) and F’. Fix zp € M. 
Use geodesic normal coordinates centered at 7; in these coordinates, x) = 0. Let 
{e,} denote an orthonormal frame of tangent vectors, obtained by parallel trans- 
lation along geodesics from xo of an orthonormal basis of T7,, M; let {va} denote 
the dual frame. The frame {e.} gives rise to a local trivialization of the spinor 
bundle $(P). Finally, choose an orthonormal frame {v,,} of F, obtained by par- 
allel translation along geodesics from xg of an orthonormal basis of fF. The 
connection coefficients for the Levi-Civita connection will be denoted as I” e; for 
the coordinate frame, I'“,; for the frame {eq}; both sets of connection coeffi- 
cients vanish at 0, their first derivatives at 0 being given in terms of the Riemann 
curvature tensor. Similarly, denote by 0; = (0",;) the connection coefficients 
for F’, with respect to the frame {y,,}. Denote by ®yg the curvature of F’, with 
respect to the frame {e,}. 

With respect to these choices, we write down a local coordinate expression for 
D2, using the Weitzenbock formula 


1 1 
De => Viv + 7° = gVarp@ap, 


356 10. Dirac Operators and Index Theory 


together with the identity V*V = —yoV?, proved in Proposition 2.1 of Appendix 
C. We obtain 


De = agit (0; + Peas VUgug + 6; (a + ay UyU6 + - 
(6.30) i 
+ g°T*, i(3 + ve aitatg + 4; )+ 1-5 DagVap- 

This has scalar second-order part. The coefficients of 0; are products of elements 
of Cl?) (2k) with connection coefficients, which vanish at 0. Terms involving no 
derivatives include products of elements of C1?) (2k) with curvatures, which may 
not vanish, and products of elements of Cl) (2k) with coefficients that vanish to 
second order at 0. 

Hence we can say the following about the operators Bg_¢(x,&,D.), which 
arise in (6.18) and which enter into the recursive formulas for a;(t, x, €). First, 
Bo(x,€, Dx), a differential operator of order 2 that is homogeneous of degree 0 
in € (thus actually independent of £), can be written as 


Bo(x,€,Dz) = S~ Boa(x, €)D 


|a|<2 


where Boo(a,€) has coefficients in C7'?)(2k), and also coefficients that are 
O(\z|?) in CI (2k); Boa(x,€) for ja| = 1 has some coefficients that are 
O(|a|) in Cl) (2k). Each Boa (a, €) for |a| = 2 is scalar. Note that Bo(zx, €, Dx) 
acts on a;_2(t, x, €) in the recursive formula (6.21)—(6.22) for a;(t, x, €). 

The operator By(x,&, D.), a differential operator of order 1 that is homoge- 
neous of degree 1 in €, can be written as 


By (ax, €, Dz) = \ Be@op? 


Ja|<1 


and among the coefficients are terms that are O(|2|) in Cl?) (2k). The operator 
By (a, €, Dz) acts on aj_1(t, w, €) in (6.21)-(6.22). 

We see that while the coefficients in Cl!4(2k) in a,(t, a, €) give rise to coef- 
ficients in Cll] (2k) in aj4i(t,a,€) and in Cll ed (2k) in aj42(t, x, €), the 
degree of vanishing described above leads exactly to a sort of increase in 
“Clifford order’ stated in Lemma 6.3, which is consequently proved. 

The proof of Lemma 6.3 gives more. Namely, the Cl?/|-components of 
a;(@9), for 0 < j < k, are unchanged if we replace Dj, by the following: 


(6.31) L=-0 Sa hag "1g Va 
: = va Ox; 8 Gee 5} aBVYalB, 


where (2;¢ denotes the Riemann curvature tensor, acting on sections of S (P) as 


6. Proof of the local index formula 357 
(6.32) OQy¢ = Ryeop ValB: 


In (6.31), summation over @ is understood. At this point, we can exploit a key ob- 
servation of Getzler—that the Schwartz kernel IX (t,x, y) of e~” can be evaluated 
in closed form at y = 0—by exploiting the similarity of (6.31) with the harmonic 
oscillator Hamiltonian, whose exponential is given by Mehler’s formula, provided 
we modify L in the following fashion. 

Namely, for the purpose of picking out the C1? |-components of a; (xo), we 
might as well let Z act on sections of Cl(2k) ® F rather than S(P) @ F, and then 
we use the linear isomorphism Cl(2k) ~ A*R”, and let the products involving 
Uq and vg in (6.31) and (6.32) be wedge products, which, after all, for the purpose 
of our calculation are the principal parts of the Clifford products. 

We can then separate L into two commuting parts. Let Lo denote the sum over 
j in (6.31), and let Kolt, x,y) be the Schwartz kernel of e~'Lo. We can evaluate 
Ko (t, 2,0) using Mehler’s formula, established in §6 of Chap. 8 (see particularly 
Exercises 6 and 7 at the end of that section), which implies that whenever ((0;¢) is 
an antisymmetric matrix of imaginary numbers (hence a self-adjoint matrix), then 


- Ot/4 —\1/2 
Kolt = (4nt)—"/? det( ————_ Fee ae 
(6.33) o(t, ,0) = (4nt) (aaa) : ) 


where f(s) = 2s coth 2s. Now it is straightforward to verify that this formula is 
also valid whenever 22 is a nilpotent element of any commutative ring (assumed 
to be an algebra over C), as in the case (6.32), where 2 is an End(T;,, 1/)-valued 
2-form. Evaluating (6.33) at x = 0 gives 

~ Ot/4 1/2 
6.34 Ko(t,0,0) = (4mt)-"/? det(——-—_ ) 
( ) o( ahs ) ( is ) € sinh(Qt/4) 
When 2 is the curvature 2-form of M, with its Riemannian metric, this is to 


be interpreted in the same way as the characteristic classes discussed in §7 of 
Appendix C. The A-genus of M is defined to be this determinant, at ¢ = 1/277: 


* Q./87i 1/2 
| er ee 
on) tN an OerG 
The Ci!?*l-component of the t°-coefficient in the expansion of eth is 


(—2i)—* times the 2k-form part of the product of (6.35) with Tr e~®/?7*, where 
® is the End F’-valued curvature 2-form of the connection on F’. This is also a 
characteristic class; we have the Chern character: 


(6.36) Ch(F) = Tr e~ 2/277, 


This completes the proof of Theorem 6.1. 


358 10. Dirac Operators and Index Theory 
Exercises 


1. Write out the first few terms in the expansion of the formula (6.35) for A(M ), such as 
forms of degree 0, 4, 8. 
2. If M is a compact, oriented, four-dimensional manifold, show that 


(6.37) (AWM), (M]) =-— | (TM), 


where p is the first Pontrjagin class, defined in 87 of Appendix C. 

3. If M = CP”, show that (A(M), [M]) = —1/8. Deduce that CP? has no spin struc- 
ture. 

4. If M isa spin manifold with positive scalar curvature, to which Proposition 4.8 applies, 


show that (A(M), [/]) = 0. What can you deduce about the right side of (5.3) in such 
a case? Consider particularly the case where dim M = 4. 
5. Let F; — M be complex vector bundles. Show that 


Ch(F, © F) = Ch(F,) + Ch(FR), 
Ch(F, ® Fa) = Ch(Fi) A Ch(F4). 


6. If F — M is a complex line bundle, relate Ch(F’) to the first Chern class ci(F), 
defined in §7 of Appendix C. 


7. The Chern—Gauss—Bonnet theorem 


Here we deduce from the Atiyah—Singer formula (5.3) the generalized Gauss— 
Bonnet formula expressing as an integrated curvature the Euler characteristic 
x(M) of a compact, oriented Riemannian manifold M, of dimension n = 2k. 
As we know from Hodge theory, (4) is the index of 


(7.1) gta eh OU 3 1" i. 
This is an operator of Dirac type, but it is not actually a twisted Dirac operator of 
the form (3.34), even when / has a spin structure. Rather, a further twist in the 
twisting procedure is required. Until near the end of this section, we assume that 
M has a spin structure. 

With V = R?*, we can identify CA*V, both as a linear space and as a Clifford 
module, with C/(2k). Recall the isomorphism (2.14): 
(7.2) pe: Cl(2k) —> End S, 
where S = S(2k) = S,(2k) ® S_(2k). This can be rewritten as 


Cl(2k) + S@ S'. 


7. The Chern—Gauss—Bonnet theorem 359 


Now if C/(2k) acts on the left factor of this tensor product, then there is a twisted 
Dirac operator 


0 Dz 
m9 (3°) 


produced from the grading S @ S’ = (S; @ S’) © (S_ @ S’), but this is not the 
operator (7.1). Rather, it is the signature operator. To produce (7.1), we use the 
identities CA°"V = C1°(2k) and CAV = Cl+(2k). Recall the isomorphism 
(3.15): 

(7.4) p: C1°(2k) —> End S, @ End S_. 

We rewrite this as 

(7.5) Cl°(2k) = (94 @ S41) 6 (S_@S"). 

Similarly, we have an isomorphism 

(7.6) pw: Cl'(2k) —> Hom(S;,S_) ® Hom(S_, $4), 

which we rewrite as 


(7.7) Cl" (2k) = (S_ @ S',) © (S$, @S"). 


It follows from this that the operator (7.1) is a “twisted” Dirac operator of the 
form 


: 0 Dz ® Dé, 
= + = 
cre) Di, ® Dy 0 
+ ‘e 


In other words, the index x(M) of (7.1) is a difference: 


Index Ds, — Index Dae, 


since Index D,, = — Index DS . Furthermore, this difference is respected in the 


local index formula, an observation that will be useful later when we remove the 
hypothesis that 7 have a spin structure. 
The Atiyah—Singer formula (5.3) thus yields 


(7.9) x(M) = (A(M)[Ch(S',) — Ch(S“_)], []). 


360 10. Dirac Operators and Index Theory 


The major step from here to the Chern—Gauss—Bonnet theorem is to produce a 
2k-form on M expressing Ch(S’_) — Ch(S“) in purely differential geometric 
terms, independent of a spin structure. 

If 74 are the representations of Spin(2k) on S;, d+ the derived representa- 
tions of spin(2k), and © the spin(2k)-valued curvature form on P, then 


(7.10) Ch(S+) = Tr ems (Q)/2m6, 


a sum of even-order forms formally related to the characters of 7+, 


(7.11) x+4(g) = Trae(g), g © Spin(2k). 


Note that dim S, = dim S_ implies y;(e) — x_(e) = 0. It is a fact of 
great significance that the difference y,(g) — y—(g) vanishes to order k at 
the identity element e € Spin(2k). More precisely, we have the following. 
Take X € spin(2k) ~ so(2k), identified with a real, skew-symmetric matrix, 
X = (X;,;); there is the exponential map Exp : spin(2k) — Spin(2k). The key 
formula is given as follows: 


Lemma 7.1. For X € so(2k), 


(7.12) lim tly (Exp tX) — y_ (Exp tX)] = (—i)* Pf(X). 


Here, Pf : so(2k) — R is the Pfaffian, defined as follows. Associate to X € 
so(2k) the 2-form 


1 
(7.13) f=&(X)= 5 Kiger Nes, 
€1,---,€2 denoting an oriented orthonormal basis of R2*. Then 
(7.14) kl (Pf X)ey A--- Nea, =EA-++- AE (k factors). 


It follows from this definition that if T : R?* + R?* is linear, then T*£(X) = 
&(T* XT), so 


(7.15) Pf (T° XT) = (det T)Pf (X). 


Now any X € so(n) can be written as X = T*AT, where T € SO(n), and A is 
a sum of 2 x 2, skew-symmetric blocks, of the form 


Thus €(A) = aye1 A €2 +--+ + aKxe2n—1 A €2k, SO 


7. The Chern—Gauss—Bonnet theorem 361 
(7.16) Pf (X) = Pf (A) = a1 --- ag. 
It follows that 
(7.17) Pf(X)* = det X. 


We also note that, if one uses Clifford multiplication rather than exterior 
multiplication, on k factors of €(X), then the result has as its highest-order term 
k\(Pf X)e1 - ++ eax. In other words, in terms of the map 7 : Cl(2k) > C of (7.7), 


(7.18) k\(Pf_X) = 7(€---€), 


with k factors of €. 

To prove Lemma 7.1, note that the representation 7 = 74 © 7_ of Spin(2k) 
on S = S, @ S_ is the restriction to Spin(2k) of the representation us of Cl(2k) 
on S characterized by (2.11). Consequently, in view of Exercise 4 in 83, 


(7.19) Tr 7. (Exp X) — Tr w_(Exp tX) = Str p(et® PasicgiA) 


where Str stands for the supertrace, as in (6.6). This can be evaluated by Berezin’s 
formula, (6.10), as (—2z)* times the coefficient of v = e1 - «+ eg ine? de Xajeies /4 
Now the lowest power of ¢ in the power-series expansion of this quantity, which 
has a multiple of v as coefficient, is the Ath power; the corresponding term is 


ae a 
(7.20) = aE(d Xiseies) = 5g(PEX)v +e, 
by (7.18). Thus, by (6.10), the leading term in the expansion in powers of t of 
(7.19) is (—it)*(Pf X), which proves (7.12). 

We remark that the formula (7.12) plays a central role in the proof of the index 
formula for (twisted) Dirac operators, in the papers of Bismut [Bi] and of Berline— 
Vergne [BV]. 

In §8 of Appendix C, it is shown that the Pfaffian arises directly for the general- 
ized Gauss—Bonnet formula for a hypersurface IM C R?**+! when one expresses 
the degree of the Gauss map M —+ $?* as an integral of the Jacobian determinant 
of the Gauss map and evaluates this Jacobian determinant using the Weingarten 


formula and Gauss’ Theorema Egregium. 
From (7.12) it follows that 


(7.21) Ch(S/,.) — Ch(S”_) = (27)~* Pf (Q). 
This is defined independently of any spin structure on M. Since locally any 


manifold has spin structures, the local index formula of §6 provides us with the 
following conclusion. 


362 10. Dirac Operators and Index Theory 


Theorem 7.2. [f M is a compact, oriented Riemannian manifold of dimension 
n = 2k, then the Euler characteristic x(M) satisfies the identity 


(7.22) x(M) = (270)-* [PFO 


M 
Proof. It remains only to note that in the formula 
(2m)~*(A(M)Pf (Q), [(M]) = x(M), 


since the factor Pf(Q) is a pure form of degree 2k = n, only the leading term 1 in 
A(M) contributes to this product. 


Exercises 


1. Verify that when dim M = 2, the formula (7.22) coincides with the classical Gauss— 
Bonnet formula: 


(7.23) iy K dV = 2nx(M). 
M 


2. Work out “more explicitly” the formula (7.22) when dim M = 4. Show that 


1 : 
(7.24) x(M) = sa [ (ee — 4|Ric|? + s’) dV, 
7 
M 
where RF is the Riemann curvature tensor, Ric the Ricci tensor, and S the scalar curva- 
ture. For some applications, see [An]. 


3. Evaluate (7.19); show that 


e; 1/2 
(7.25) Str ul = (-it)* aer( BX?) Pr X, 


£X/2 


(Hint: Reduce to the case where X is a sum of 2 x 2 blocks.) 

4. Apply Theorem 6.1 to give a formula for the index of the signature operator Di, using 
the representation (7.3) of Dt © Dj, as a twisted Dirac operator. Justify the formula 
when M has no spin structure. Show that, if 17 is a compact, oriented 4-manifold, then 


(7.26) Index Dj, = —8(A(M), [M}). 


(Hint: Take a peek in [Roe].) 


8. Spin manifolds 


Here we consider a structure that arises more frequently than a spin structure, 
namely a spin® structure. Let M be an oriented Riemannian manifold of dimen- 


8. Spin® manifolds 363 


sion n, P — M the principal SO(n)-bundle of oriented orthonormal frames. A 
spin® structure on M is a principal bundle Q — M with structure group 


(8.1) Spin*(n) = Spin(n) x S*/{(1, 1), (—1, -1)} = G. 


Note that {—1, 1} C Spin(n) is the pre-image of the identity element of SO(n). 
For this principal bundle Q, we require that there be a bundle map p : Q > P, 
commuting with the natural Spin(n) actions on Q and P. 

There is a natural injection Spin(n) — Spin°(n), as a normal subgroup. Note 
that taking the quotient R = Q/Spin(n) produces a principal S+-bundle, over 
which Q projects. We display the various principal bundles: 


Q — R 


(8.2) | | 


P — > M 


There is a topological obstruction to the existence of a spin® structure on M/, 
though it is weaker than the obstruction to the existence of a spin structure. We 
refer to [LM] for these topological considerations; we will give some examples of 
spin°-manifolds later in this section. 

The standard representation of S! on C produces a complex line bundle 


(8.3) L—M. 


Suppose n = 2k. Recall the representation Diy ® Di), of Spin(n) on (2k) 
from (3.19). If we take the product with the standard representation of S' on C, 
this is trivial on the factor group appearing in (8.1), so we get a representation of 
Spin®(n) on $(2k), which we continue to denote Dy, © D, 9. This representa- 
tion produces a vector bundle over M/, which we continue to call a spinor bundle: 


(8.4) S(Q) = S4(Q) @S_(Q);  $z(Q) =Q x px Si (2k). 


1/2 


In case n is odd, we have instead the bundle of spinors constructed from the 
representation (3.24) of Spin(7), via the same sort of procedure. 

As in §3, we will be able to define a Dirac operator on C™(M, S(Q)) in terms 
of a connection on Q, which we now construct. The Levi—Civita connection on 
M defines an so(n)-valued form 09 on P, which pulls back to an so(n)-valued 
form 4 on Q. Endow the bundle R = Q/Spin(n) — M with a connection 61, 
so L — M gets a metric connection. Then 6; pulls back to an 7R-valued form 6; 
on Q, and 


(8.5) 6=0)+6 


364 10. Dirac Operators and Index Theory 


defines a spin®(7)-valued form on Q, which gives rise to a connection on Q. This 
leads to a connection on the spinor bundle S(Q) — M, and the analogues of 
Propositions 3.4 and 3.5 hold. Thus we produce the Dirac operator 


(8.6) D=imoV:C™(M,S) — C®(M,5S). 


More generally, if & —> M is a vector bundle with a metric connection, one 
gets a Clifford connection on S(Q) ® E and hence a twisted Dirac operator 


(8.7) Dg: C®(M,S ® E) —+ C®(M,S @ E). 


If dim MW is even, Dg maps sections of Si ® EF to sections of Sx ® E. 

We consider some ways in which spin® structures arise. First, a spin structure 
gives rise to a spin® structure. Indeed, if the frame bundle P — M lifts toa 
principal Spin(n)-bundle P — M, then Q can be taken to be the quotient of the 
product bundle PxS14M by the natural Z2-action on the fibers. The canonical 
flat connection on S! x M — M is used, to provide Q — M with a connection, 
and then the Dirac operator (8.6) defined by Q — M coincides with that defined 
by Po M. 

Another family of examples of spin® structures of considerable importance 
arises as follows. Suppose M is a manifold of dimension n = 2k with an almost 
complex structure, J: T, WM —- T,M, J 2 — —]. Endow M with a Riemannian 
metric such that J is an isometry. TM, which is (T'M, J) regarded as a complex 
vector bundle of fiber dimension k, then acquires a natural Hermitian metric, as 
in (2.9). The associated frame bundle F' + M is a principal U(k) bundle. Note 
that 


(8.8) U(k) © SU(k) x S*/P, 


where [ = {(J,1),(—J,—1)}. Since SU(k) is simply connected, the inclusion 
U(k) © SO(n) yields a uniquely defined homomorphism 


(8.9) SU(k) — Spin(n), 

and hence a homomorphism 

(8.10) U(k) & SU(k) x S1/T —> Spin®(n). 

From the bundle F — M, this gives rise to a principal Spin°(n) bundle Q > M. 
In this case, the map U(k) + Spin°(n) — S' is given by the determinant, 

det : U(k) — S*. The principal St-bundle R + M is obtained by taking the 


quotient of the principal U(k)-bundle F by the action of SU(k). The associated 
line bundle L — M is seen to be 


(8.11) L= ART. 


8. Spin® manifolds 365 


Other geometrical structures give rise to spin® structures; we refer to [LM] for 
more on this. We mention the following: namely, any oriented hypersurface in a 
spin® manifold inherits a natural spin® structure. In this fashion the sphere bundle 
S*M over a Riemannian manifold gets a spin® structure, as a hypersurface of 
T* M, which can be given an almost complex structure. 

Though a spin® structure is more general than a spin structure, it is a very 
significant fact that a spin® structure in turn gives rise to a spin structure, in the 
following circumstance. Namely, suppose the principal $1-bundle R > M lifts 
to a double cover 


(8.12) R— M, 
corresponding to the natural two-to-one surjective homomorphism sq : $1 — S?. 


This is equivalent to the hypothesis that the line bundle L + M possess a “square 
root” \ +> M: 


(8.13) A@AHL. 


In such a case, the quotient of Q x ROM by the natural action of St on each 
factor gives a lift of Q to a principal Spin(n) x S-bundle 


(8.14) Q— M. 

Then the quotient 

(8.15) P=Q/S'—+M 

defines a spin structure on M. The vector bundles $(Q) and S(P) are related by 
(8.16) S(Q) = S(P) @X. 


Furthermore, the connection on S(Q) defined above coincides with the product 
connection on S' (P) ® A arising from the natural connections on each factor. 
Therefore, if Dz and D®, are respectively the twisted Dirac operator associated 
with a vector bundle EH — M (given a metric connection) via the spin® and spin 
structures described above, then 


(8.17) Dp SD) es 


This holds, we recall, provided L has a square root X. 

One consequence of this is the following extension of the Weitzenbock formula 
(4.24). Namely, if Dz is the twisted Dirac operator on S(Q) ® E described there, 
then applying (4.24) to the right side of (8.17) gives 


(8.18) DR=VIV+K, 


366 10. Dirac Operators and Index Theory 


with 


1 1 1 
(8.19) Ky= qo? a So vivir (ei, ej; )~ — 5 Sep es, €;)Y, 
i,j tJ 


or equivalently 


1 1 1 
(8.20) Kyp= a" -| x vjvjw” (e4,e;)~p — 5 S- v4vjR” (ei, €5)Y, 
a,j a,j 


where, as before, {e;} is a local orthonormal frame of vector fields on M, with 
dual frame field {v; }. Here w* is the curvature form of the line bundle \ and w” 
that of L. 

Now locally there is no topological obstruction to the existence of the lift 
(8.12). Consequently, the identity (8.20) holds regardless of whether L possesses 
a global square root. Therefore, the proof of the local index formula given in 86 
extends to this case. Furthermore, we have the pointwise identity of forms: 


(8.21) Ch(A @ E) = e@ Ch(E),  1(A) = seul), 


where c; is the first Chern class, defined in §7 of Appendix C. Therefore, we have 
the following extension of Theorem 5.1: 


Theorem 8.1. /f M is a compact Riemannian manifold of dimension n = 2k 


with spin® structure and Dp : C°@(M, S,® E) + C®(M, S_ @ E) is a twisted 
Dirac operator, then 


(8.22) Index Dp = (emthy/2 Ch(E)A(M), [M]), 


where L is the line bundle (8.3), and c,(L) is its first Chern class. 


The index formula for twisted Dirac operators on spin® manifolds furnishes a 
tool with which one can evaluate the index of general elliptic pseudodifferential 
operators. Indeed, let P be any elliptic pseudodifferential operator (of order m), 


(8.23) Pe C?(M ty) > C?( MF): 


E; — M being vector bundles. Then, as seen in Chap. 7, we have the principal 
symbol 


(8.24) op € C~(S*M, Hom(E, £1), 


E 'j — S*M being the pull-backs of LE; + M. The ellipticity of P is equivalent 
to op being an isomorphism at each point of S* /. Now, we can construct a new 


9. The Riemann—Roch theorem 367 


vector bundle / over BM , the double of the ball bundle B* M, as follows. We 
let E; also denote the pull-back of E; to B* M, and, when the two copies of 
B* M are glued together along S*M to form BM , we also glue together Eo and 
Ey, over S*M, using the isomorphism (8.24). The construction of F + BM by 
this process is known as the “clutching construction.” Now BM can be given a 
Riemannian metric, and also a spin® structure, arising from the almost complex 
structure on B*M. If EF is endowed with a connection, one obtains a twisted Dirac 
operator Dz on BM. The following result, together with the formula for Index 
Dz given by Theorem 8.1, provides the general Atiyah—Singer index formula. 


Theorem 8.2. /f P is an elliptic pseudodifferential operator, giving rise to a 
twisted Dirac operator Dg by the clutching construction described above, then 


(8.25) Index P = Index Dr. 


The proof of this result will not be given here; it involves use of the Bott pe- 
riodicity theorem. Related approaches, computing Index P from a knowledge of 
the index of twisted signature operators, are discussed in [Pal] and [ABP]. A re- 
finement of (8.25), involving an identity in ’-homology is established in [BDT]. 


Exercises 


1. Consider the following zero-order pseudodifferential operator on L?(S*): 
Q=M;P+M,(1-P), 


where P is the projection 


co oo 
P ; cre” = > ene’, 
—oco 0) 


We assume f and g are smooth, complex-valued functions; Myu = fu. If f and g are 
nowhere vanishing on S*, Q is elliptic. A formula for its index is produced in Exercises 
1-5 of Chap. 4, §3. 

Construct the associated twisted Dirac operator Dz, acting on sections of a vector 
bundle over the manifold BS! ~ T?. Evaluate the index of Dr using Theorem 8.1, 
and verify the identity (8.25) in this case. 


9. The Riemann—Roch theorem 


In this section we will show how the index formula (8.22) implies the classi- 
cal Riemann—Roch formula on compact Riemann surfaces, and we also discuss 
some of the implications of that formula. For implications of generalizations of 
the Riemann-Roch formula to higher-dimensional, compact, complex manifolds, 
which also follows from (8.22), see [Har] and [Hir]. 


368 10. Dirac Operators and Index Theory 


Let M be a compact two-dimensional manifold, with a complex structure, de- 
fined by J : T,M — T,M, J? = —I. As shown in Chap. 5, §10, this a priori 
“almost complex” structure automatically gives rise to holomorphic charts on MW 
in this dimension. We can put a Riemannian metric and an orientation on M such 
that J is an isometry on each tangent space, counterclockwise rotation by 90°. 
Then 7M gets the structure of a complex line bundle, which we denote 7M, 
with a Hermitian metric. We have the dual line bundle 7’ M. Note that the Her- 
mitian metric on JM yields a Hermitian metric on 7’M and also produces a 
conjugate linear bundle isomorphism of 7 M with 7’ M. We also define the com- 
plex line bundle 7 M to be the tangent bundle TM with complex structure given 
by —J and 7 M to be its dual. 

A function u € C°°(M) is holomorphic if 0u/O0Z = 0 in any local holomor- 
phic coordinate system and is antiholomorphic if 0u/Oz = 0. We denote the space 
of holomorphic functions on an open set U C M by Oy, and antiholomorphic 
functions by Oy. There are invariantly defined operators 


(9.1)  0:C°(M)—+C™*(M,T’), 3:C°(M) > C™(M,T), 


given as follows. If X is a real vector field, namely, a section of TM, set 


(9.2) Oxu= 5(Xu —i(JX)u), Oxu= 5(Xu + i(JX)u). 
Note that 
(9.3) Ox = iOxu, Oyxu = —idxu, 


which justifies (9.1). 

In addition to holomorphic functions, we also have the notion of a holomorphic 
line bundle over M/. Given a complex line bundle L + M, let {U;} be a covering 
of M by geodesically convex sets. A holomorphic structure on L is a choice of 
nowhere-vanishing sections s; of L over U; such that s; = oj~8, on Ujx = 
U; 1 Ux, with oj, holomorphic complex-valued functions. Similarly, a choice of 
nowhere-vanishing sections t; of L over U; such that t; = Tjxtp on Ujn, Tik 
antiholomorphic, gives L the structure of an antiholomorphic line bundle. 

The bundle 7 / has a natural structure of a holomorphic line bundle; in a local 
holomorphic coordinate system {U;}, let s; = 0/Ox. T’ is a holomorphic line 
bundle with s; = dz. To see this, note that if 7 : U — V is a holomorphic map 
relating two local coordinate charts on M, ¢) = u+iv, then (Dz)(0/0z) is equal 
to 


Oud OA dd Ov ot Ge) O _e 


ici” Ono) Onde On On (On a) on On On 


9. The Riemann—Roch theorem 369 


Here, the first two quantities are regarded as local sections of 7’, the last two 
as local sections of TM. Similarly, 7 and T have natural structures as antiholo- 
morphic line bundles, using the same choices of local sections as above. 

It is also common to identify 7M and 7M with complementary subbun- 
dles of the complexified tangent bundle Tc-M = C ® TM, a complex vector 
bundle whose fibers are two-dimensional complex vector spaces. Namely, the lo- 
cal section 0/Ox of TM is identified with (1/2)(0/0x — i0/Oy) = 0/0z to 
yield TM — TcM and it is identified with (1/2)(0/Ox + 10/Oy) = 0/02 to 
yield 7M <> TcM. More generally, these two maps are given respectively by 
X + (1/2)(X —iJX) and X + (1/2)(X + iJX). Identifying TM and TM 
with their images in Tc M, we have 


TcM =TM OTM. 


Similarly, we have the complexified cotangent bundle 77M = C @ I*M, and 
natural injections T’M > TzM, TMo Te M, so that 


TEM =T'M@T M. 


In this case, dx is mapped respectively to (dx + idy)/2 = dz/2 and to (da — 
idy) /2 = dz/2. 

We use the following common notation for these line bundles equipped with 
these extra structures: 


(9.4) T=«K), T=n, T=e!, THR. 
We can rewrite (9.1) as 
€:C™°(M) — C™(M,«), 8: C°(M) — C™(M,R&). 


We note that «~! and F are isomorphic as C®-line bundles; x is called the canon- 
ical bundle. 

More generally, if L —> M is any holomorphic line bundle, we have a naturally 
defined operator 


(9.5) 0:C™(M,L) — C”(M,L@R&), 


defined as follows. Pick any local (nowhere-vanishing) holomorphic section S 
of L, for example, S = s; on U;, used in the definition above of holomorphic 
structure. Then an arbitrary section wu is of the form u = vS, v complex-valued, 
and we set 


(9.6) Ou = a S ® dz. 
OZ 


370 10. Dirac Operators and Index Theory 


It is easy to see that this is independent of the choice of holomorphic section 
S or of local holomorphic coordinate system. Sometimes, to emphasize the de- 
pendence of (9.5) on L, we denote this operator by Oy. The operator (9.5) is 
a first-order, elliptic differential operator, and the Riemann—Roch formula is a 
formula for its index. 

The kernel of 37, in (9.5) consists of holomorphic sections of L; namely, 
sections u such that, with respect to the defining sections s; on U;, u = vj;8; 
with v; holomorphic. We denote this space of holomorphic sections by 


(9.7) O(L) = ker Or. 


The significance of the Riemann—Roch formula lies largely in its use as a tool for 
understanding as much as possible about the spaces (9.7). 

The cokernel of 07, in (9.5) can be interpreted as follows. The Hermitian metric 
on 7 gives rise to a trivialization of & @ & and to a duality of L?(M, L @ &) with 
L?(M, L~! @ k). With respect to this duality, the adjoint of Oz, is 


(9.8) —0:C™(M,L“' @ x) — C™(M,L7' @K@R). 
Consequently, 
(9.9) Index 0; = dim O(L) — dim O(L~' @ k). 


The Riemann-Roch theorem will produce a formula for (9.9) in terms of topolog- 
ical information, specifically, in terms of c;(Z) and ci(k). 

Recall that / has a natural spin® structure, arising from its complex structure. 
We will produce a twisted Dirac operator on M whose index is the same as that of 
O_. In fact, when the construction of the spinor bundle made in §8 is specialized 
to the case at hand, we get 


(9.10) S,=1, S_=THR, 


where 1 denotes the trivial line bundle over /. Furthermore, the line bundle de- 
noted as L in (8.11) is T & «1. If L is a (holomorphic) line bundle over M, 
we give L a Hermitian metric and metric connection V. Then the twisted Dirac 
operator 


(9.11) Dp: C"(M,L) > C°(M, Len) 
is given by 

i 
(9.12) (Dru, X) = ees iVjxu), 


for X a section of TM, identified with TM ~ &’, noting that 


(9.13) (Diu, JX) = —(Dyu, X). 


9. The Riemann—Roch theorem 371 
It is easy to see that 0; and Dy are differential operators with the same prin- 


cipal symbol. Disregarding the question of whether one can pick a connection on 
L making these operators equal, we clearly have 


(9.14) Index 0; = Index Dy. 


Now applying the index formula (8.22) to the right side of (9.14) gives 


(9.15) Index Dy, = (eretn/2 Ch(L)A(M), [M]). 

Since A(M ) is 1 plus a formal sum of forms of degree 4, 8,..., we obtain 
1 

(9.16) Index Dy, = c,(L)[M] - cule) [M1]. 


Putting together (9.9), (9.14), and (9.16) gives the Riemann—Roch formula: 


Theorem 9.1. /f L is a holomorphic line bundle over a compact Riemann surface 
M, with canonical bundle x, then 


(9.17) dim O(L) — dim O(L7! ® k) = ¢(L)[M] — 5eu(s)1M], 


According to the characterization of the Chern classes given in §7 of 
Appendix C, if Z has a connection with curvature 2-form wy, then 


1 
M 
In particular, c,(#)[M] is given by the Gauss—Bonnet formula: 
(9.19) e1(«)[M] = —x(M) = 2g - 2, 


where x(J/) is the Euler characteristic and g is the genus of M. 
We begin to draw some conclusions from the Riemann—Roch formula (9.17). 
First, for the trivial line bundle 1 we clearly have 


(9.20) dim O(1) = 1, 


assuming M is connected, since holomorphic functions on / must be constant. 
If we apply (9.17) to L = x, using K '@« = 1 and the formula (9.19), we obtain 


(9.21) dim O(«) = g. 


372 10. Dirac Operators and Index Theory 


The space O(«) is called the space of holomorphic 1-forms, or “Abelian differen- 
tials.’ We claim there is a decomposition 


(9.22) H,(M) = O(k) © O(R), 


of the space H1(M) of (complex) harmonic 1-forms on M into a direct sum of 
O(K) and the space O(&) of antiholomorphic sections of &. In fact, the Hodge 
star operator * : A1M —+ A!M, extended to be C-linear on C @ A! M, acts on 
Hi(M), with ** = —1, and O(x«) and O(R) are easily seen to be the 7 and —i 
eigenspaces of « in H,(M/). Furthermore, there is a conjugate linear isomorphism 


(9.23) C: O(K) — O(F) 


given in local holomorphic coordinates by 


C(u(z) dz) = u(z) dz. 


Now (9.22) and (9.23) imply 
: 1. tL gs 1 
(9.24) dim O(K) = : dim H,(M) = 5 dim H°(M,C), 


where !(M,C) is a deRham cohomology group, and the last identity is by 
Hodge theory. Granted that dim H!(M, C) = 2g, this gives an alternative deriva- 
tion of (9.21), not using the Riemann—Roch theorem. Compare the derivation of 
(10.28) in Chapter 5. 

The Hodge theory used to get the last identity in (9.24) is contained in 
Proposition 8.3 of Chap.5. Actually, in §8 of Chap. 5, #1 denoted the space of 
real harmonic 1-forms, which was shown to be isomorphic to the real deRham 
cohomology group H'(M,R), which in turn was denoted H!(M) there. 

Just for fun, we note the following. Suppose that instead of (9.17) one had in 
hand the weaker result 


(9.25) dim O(L) — dim O(L~1 @ «) = Aci(L)[M] + Bei(«)[M], 


with constants A and B that had not been calculated. Then using the results (9.19) 
and (9.21), one can determine A and B. Indeed, substituting L = 1 into (9.25) 
gives 1 — g = B(2g — 2), while substituting LD = x in (9.25) gives g —1 = 
(A+ B)(2g — 2). As long as g # 1, this forces A= 1, B = —1/2. The g = 1 
case would also follow if one knew that (9.25) held with constants independent 
of M. 

Before continuing to develop implications of the Riemann—Roch formula, we 
note that, in addition to O(L), it is also of interest to study M(L), the space of 
meromorphic sections of a holomorphic line bundle. The following is a funda- 
mental existence result. 


9. The Riemann—Roch theorem 373 


Proposition 9.2. If L — M is a holomorphic line bundle, there exist nontrivial 
elements of M(L). 


Proof. The operator (9.5) extends to 
(9.26) @: H°+1(M, L) — H°(M,L@R&), 


which is Fredholm. There are elements v1,...,vK € C®(M, L~!@k) such that, 
for alls € R,if f ¢ H*(M,L @&) and (f,v;) =0 for j =1,..., K, then there 
exists u € H*++(M,L) such that du = f. Now, for s < —1, there is a finite 
linear combination of “delta functions,” in H*(M/, L ®&), orthogonal to these v;. 
Denote such an f by f = )>aj6,,. Then let u € H**1(M, L) satisfy Ou = f. 
In particular, Qu = 0 on the complement of a finite set of points. Near each p € 
supp f, u looks like the Cauchy kernel, so wu is a nontrivial meromorphic section 
of L. 


Such an existence result need not hold for O(L); in Corollary 9.4 we will see 
a condition that guarantees O(L) = 0. Such a result should not be regarded in 
a negative light; indeed knowing that O(L) = 0 for some line bundles can give 
important information on O(L,) for certain other line bundles, as we will see. 

Any nontrivial u € M(L) will have a finite number of zeros and poles. If p is 
a zero of u, let ,,(p) be the order of the zero; if p is a pole of u, let —1,,(p) be the 
order of the pole. We define the “divisor” of u € M(L) to be the formal finite sum 


(9.27) d(u) = S— vu(p) +p 


Pp 


over the set of zeros and poles of wu. It is a simple exercise in complex analysis 
that if u is a nontrivial meromorphic function on M (i.e., an element of M(1)), 
then >7,, Yu(p) = 0. The following is a significant generalization of that. 


Proposition 9.3. If L — M is a holomorphic line bundle and u € M(L) is 
nontrivial, then 


(9.28) ex(L)[M] = S> vu (p). 


Pp 


Proof. The left side of (9.28) is given by (9.18), where wz is the curvature 2-form 
associated to any connection on L. We will use the formula 


1 
(9.29) — 55 fo = Index X, 
2771 
M 


for any X € C™(M,L) with nondegenerate zeros, proved in Appendix C, 
Proposition 5.4, as a variant of the Gauss—Bonnet theorem. The section X will be 


374 10. Dirac Operators and Index Theory 


constructed from u € M(ZL) as follows. Except on the union of small neighbor- 
hoods of the poles of wu, we take X = u. Near the poles of u, write u = vS, Sa 
nonvanishing holomorphic section of L defined on a neighborhood of such poles, 
v meromorphic. Pick R > 0 sufficiently large, and replace u by (R?/0)S, where 
|u| > R. Smooth out X near the loci |v| = R. Then the formula (9.29) for X is 
equivalent to the desired formula, (9.28). 


The following is an immediate consequence. 


Corollary 9.4. If L — M is a holomorphic line bundle with c,(L)|M] < 0, then 
every nontrivial u € M(L) has poles; hence O(L) = 0. 


Note that if c,(Z)[M] = 0 and O(L) ¥ 0, by (9.28) we have that any u € 
O(L) not identically zero is nowhere vanishing. Thus we have 


(9.30) c(L)[M] = 0, O(L) 4 0 => Lis trivial holomorphic line bundle. 


To relate Corollary 10.4 to the Riemann—Roch formula (9.17), we note that 
since dim O(L~! ® k) > 0, (9.17) yields Riemann’s inequality: 


(9.31) dim O(L) > ,(L)[M] —g +1. 


In view of the identities 


ei(Ly ® L2)[M] = c1(L1)[M] + c1(L2)[M], 


(9.32) <r, 
o(L~)[M] = —c(L)[M], 

we see that 

(9.33) c1(L)[M] > 2g —-2 => O(L71 ® x) = 0. 


Thus we have the following sharpening of Riemann’s inequality: 


Proposition 9.5. If M has genus g and c\(L)[M] > 2g — 2, then 
(9.34) dim O(L) = «1 (L)[M] — g +1. 


Generalizing (9.27), we say a divisor on MM is a finite formal sum 


(9.35) 0= 5 v(p)-p, 
Pp 


v(p) taking values in Z. One defines —J and the sum of two divisors in the obvious 
fashion. To any divisor v we can associate a holomorphic line bundle, denoted Fy; 
one calls Ey a divisor bundle. To construct Ey, it is most convenient to use the 
method of transition functions. Cover MM with holomorphic coordinate sets U,, 


9. The Riemann—Roch theorem 375 


pick ~; € My,, having a pole of order exactly |v(p)| at p, if v(p) < 0, a zero 
of order exactly v(p) if v(p) > 0 (provided p € U;), and no other poles or zeros. 
The transition functions 


(9.36) Yie =U, 0; 


define a holomorphic line bundle Ey. The collection {~;,U;} defines a 
meromorphic section 


(9.37) we M(E») 
and 
(9.38) —0(p) = 0. 


Thus Proposition 9.3 implies 


(9.39) c1(E9) = — } > v(p) = (9), 


where the last identity defines (). 

Divisor bundles help one study meromorphic sections of one line bundle in 
terms of holomorphic sections of another. A basic question in Riemann surface 
theory is when can one construct a meromorphic function on M (more generally, 
a meromorphic section of L) with prescribed poles and zeros. A closely related 
question is the following. Given a divisor 0 on M, describe the space 


(9.40) M(L,0) = {ue M(L): V(u) > v}, 

where J; > J means 0; — J > 0, that is, all integers u(p) inv; —V = S> pe(p)-p 
are > 0. When L = 1, we simply write M(w) for the space (9.40). A straightfor- 
ward consequence of the construction of Fy is the following: 

Proposition 9.6. There is a natural isomorphism 


(9.41) M(L, 9) © O(L ® Es). 


Proof. The isomorphism takes u € M(L,¥) to ua, where wy is described by 
(9.36)-(9.37). 


We can hence draw some conclusions about the dimension of M(L, 0). From 
the identity (9.34) we have 


(9.42) c,(L)[M]+ (8) > 2g-—2 => dim M(L, 8) = c,(L)[M]+ (8) —g +1, 


and, in particular, 


376 10. Dirac Operators and Index Theory 
(9.43) (9) > 2g -2 => dim M(¥) = (9) —g +1. 


Also one has general inequalities, as a consequence of (9.31). 

Now Corollary 9.4 and Proposition 9.5 specify precisely dim O(L) provided 
either c;(L)[M] < 0 or ci(L)[M] > 2g — 2, but (9.31) gives weaker information 
if 0 < c(L)[M] < 2g — 2; in fact, for c,(L)[M] < g —1, it gives no information 
at all. In this range the lower bound (9.31) can be complemented by an upper 
bound. For example, (9.30) implies 


(9.44) c1(L)[M] = 0 => dim O(L) = 0or1. 


We will show later that both possibilities can occur. We now establish the follow- 
ing generalization of (9.44). 


Proposition 9.7. Let k = 0,1,...,g — 1. Then, for a holomorphic line bundle 


LM, 
(9.45) c(L)[M] =g-1—-k => 0< dimO(L) <g-k 
and 

(9.46) c(L)[M] =g-1+k=>k< dimO(L) <Q. 


Proof. First we establish (9.46). The lower estimate follows from (9.31). For 
the upper estimate, pick any divisor ) < 0 with (J) = k. Then dim O(L) < 
dim M(L, 0) = dim O(L @® E55), which is equal to g since c,(L ® Ey)|M] = 
2g — 1 and Proposition 9.5 applies. The upper estimate in (9.45) follows by inter- 
changing L and L~! ® « in the Riemann—Roch identity. 


To illustrate (9.46), we note the following complement to (9.44): 
(9.47) c1(L)[M] = 2g —2 => dim O(L) = g—lorg. 


On the other hand, the closer c;(L)[/] gets to g — 1, the greater the uncertainty 
in dim O(L), except of course when g = 0; then Corollary 9.4 and Proposition 
9.5 cover all possibilities. It turns out that, for “typical” DL, the minimum value of 
dim O(L) in (9.45)-(9.46) is achieved; see [Gu]. 

We now use some of the results derived above to obtain strong results on the 
structure of compact Riemann surfaces of genus g = 0 and 1. 


Proposition 9.8. If M is a compact Riemann surface of genus g = 0, then M is 
holomorphically diffeomorphic to the Riemann sphere S?. 


Proof. Pick p € M; with 0) = —p, so (W) = 1, (9.43) implies dim M(J) = 2. Of 
course, the constants form a one-dimensional subspace of M(w); thus we know 
that there is a nonconstant u € M(W); u cannot be holomorphic, so it must have 
a simple pole at p. The proof thus follows from the next result. 


9. The Riemann—Roch theorem 377 


Proposition 9.9. If there exists a meromorphic function u on a compact Riemann 
surface M, regular except at a single point, where it has a simple pole, then M is 
holomorphically diffeomorphic to S?. 


Proof. By the simple argument mentioned above (9.28), u must have precisely 
one zero, a simple zero. By the same reasoning, for any \ € C, wu — must 
have precisely one simple zero, sou : M — CU {00} = S? is a holomorphic 
diffeomorphism. 


Proposition 9.10. If 1 is a compact Riemann surface of genus g = 1, then there 
exists a lattice TY C C such that M is holomorphically diffeomorphic to C/T. 


Proof. By (9.21), or alternatively by (9.24), dim O(«) = 1 in this case. Pick a 
nontrivial section €. By (9.28), )> ve(p) = 2g — 2 = 0. Since € has no poles, it 
also has no zeros, that is, & is holomorphically trivial if g = 1. (Compare with 
(9.30).) 

We use a topological fact. Namely, since dim H'(M,C) = 2 if g = 1, by 
deRham’s theorem there exist closed curves 71, 72 in M such that, for any closed 
curve y in MM, there are integers m1, m2 such that 


foam [orm fo. 


a eee 2 


for any closed 1-form v on M. Granted this, it follows that if we pick py) € M, 
the map 


(9.48) Maze fg 


Po 


defines a holomorphic map 
(9.49) ®:M—->C/I", 


where I” is the lattice in C generated by ¢; = i. €, 7 = 0,1. Since € is nowhere 
vanishing, the map (9.49) is a covering map. It follows that there is a holomorphic 
covering map YW : C — WM, and the covering transformations form a group of 
translations of C (a subgroup of I’, call it). This gives the holomorphic diffeo- 
morphism M ~ C/T. We remark that, with a little extra argument, one can verify 
that (9.49) is already a diffeomorphism. 


Propositions 9.8 and 9.10 are special cases of the uniformization theorem for 
compact Riemann surfaces. The g > 2 case will be established in Chap. 14 as a 
consequence of solving a certain nonlinear PDE. Also in that chapter, an alterna- 
tive proof of Proposition 9.10 will be presented; in that case the PDE becomes 
linear. Also in Chap. 14 we present a linear PDE proof that treats the case g = 0. 


378 10. Dirac Operators and Index Theory 


We note that in the treatment of the g = 1 case given above, the Riemann—Roch 
theorem is not essential; the analysis giving (9.22)—(9.24) suffices. 

We return to the study of dim O(L), for L = Ey. We illustrate how the first 
possibility can occur in (9.44). In fact, pick distinct points p,q € M, and consider 
0 = p—q. Clearly, c;(Ep—q)[M] = 0. Now O(E,_q) ® M(p—q), and it follows 
from Proposition 9.9 that if there is a nontrivial member of M(p — q), then M 
must be the sphere S?. We thus have 


(9.50) O(Ep-q) =0 ifp#qe M, of genus g > 1. 


On the other hand, if p,q,r € M are distinct, then c)(E_»— +,)[M] = 1, and 
(9.34) applies for g = 1; hence 


(9.51) g=1= dimM(-p-q+r)=1. 


By the discussion above, a nontrivial uw € M(—p — q+ 1) cannot have just a 
simple pole; it must have poles at p and q. This proves the next result: 


Proposition 9.11. If p,q, and r are distinct points in M, of genus 1, there is 
a meromorphic function on M with simple poles at p and q, and a zero at r, 
unique up to a multiplicative constant. Similarly, if p = q #4 r € M, one has a 
meromorphic u with a double pole at p, and a zero at r. 


Given that 1/ = C/T, these meromorphic functions are the elliptic functions 
of Weierstrass, and they can be constructed explicitly. The uniqueness statement 
can also be established on elementary grounds. Note that, with p,q, and r as 
in Proposition 9.11, the corresponding elliptic function wu vanishes at one other 
uniquely determined point s (or perhaps has a double zero at r, so s = r). In other 
words, if we set ) = —p—q+r-+-, for M of genus 1, the line bundle Fy is 
trivial for a unique s € M, given p,q,r € M, r different from p or q. Actually, 
this last qualification can be dispensed with; r = p forces s = q. It is a basic 
general question in Riemann surface theory to specify conditions on a divisor J 
(in addition to (0) = 0) necessary and sufficient for Ey to be a trivial holomor- 
phic line bundle over /. The question of whether Fy is trivial is equivalent to the 
question of whether there exists a nontrivial meromorphic function on M, with 
poles at p of order exactly |v(p)|, where v(p) < 0, in the representation (9.35) for 
J, and zeros of order exactly v(p), where v(p) > 0. This question is answered by 
a theorem of Abel; see [Gu] for a discussion. The answer is essentially equivalent 
to a classification of holomorphic line bundles over /. 


Exercises 


1. Show that the conjugate linear map C’ in (9.23) is indeed well defined, independently 
of a choice of local holomorphic coordinates. 

2. Show that if 14 is a compact Riemann surface, then the complex line bundle « has 
a square root, i.e., a line bundle such that h & A ® A. Show that A can even be 


Exercises 379 


taken to be a holomorphic square root. Thus M actually has a spin structure. (Note 
also Exercise 5 of §3.) 

Deduce the index formula (9.15), which leads to the Riemann—Roch formula, directly 
from Theorem 5.1, for twisted Dirac operators on spin manifolds. 

Is it possible to choose a connection on L such that the operators 0, and Dz in (9.14) 
are actually equal? 

Sections of the line bundle & ® «& are called quadratic differentials. Compute the di- 
mension of O(« ® «). Given a divisor 0 < 0, compute dim M(k ® k, ¥). 

Show that (9.41)-(9.42) provide an alternative proof of the existence result, Proposi- 
tion 9.2. 


. Deduce from Proposition 9.2 that every holomorphic line bundle L over a Riemann 


surface is isomorphic to a divisor bundle Fy. 


A nonconstant meromorphic function f : M — CU {oo} can be regarded as a 
holomorphic map f : M — S?, which is onto. It is called a branched covering of 
S? by the Riemann surface M. A branch point of M is a point p € M such that 
df (p) = 0. The order o(p) is the order to which df (p) vanishes at p. 

If f : M — S? is a holomorphic map with branch points p;, show that 


(9.52) S| (pj) = 2 deg(f) + 2g — 2. 


10. 


11. 


b 
(Hint: Reduce to the case where all poles of f are simple, so (counting multiplicity) 


# poles of f’ = 2 x # poles of f, 


while the left side of (9.52) is equal to # zeros of f’. Think of f’ as a meromorphic 
section of k.) 


. Give another derivation of (9.52) by triangulating S” so that the points gq; = f(p;) are 


among the vertices, pulling this triangulation back to M/, and comparing the numbers 
of vertices, edges, and faces. 

The formula (9.52) is called Hurwitz’ formula. 

Let X bea “real” vector field on a compact Riemann surface M. Assume M is givena 
Riemannian metric compatible with its complex structure, so that J : T; M — T,M 
is an isometry. Picture X as a section of the complex line bundle T = «~'. Show 
that X generates a group of conformal diffeomorphisms of // if and only if it is a 
holomorphic section of «~*. If g is the genus of M, show that 


22> O(n") =G, 

g=1=— dimc O(«') =1 

g=0=> dime O(«*) =3. 
Deduce the dimension of Lie groups of conformal diffeomorphisms in these cases. 
Compare the conclusion in case g > 2 with that of Exercise 5 of §4, given (see 
Chap. 14, §2) that one could choose a Riemannian metric of curvature —1. Compare 


the g = 1 case with Exercise 6 of §4. 
Considering M(x, p) = {u € O(K) : u(p) = 0} & O(«K ® E,), show that 


g>1= dimM(k,p) =g—-1. 


380 10. Dirac Operators and Index Theory 


12. 


13. 


14. 


15. 


Deduce that, for each p € M, there exists uw € O(x) such that u(p) 4 0, provided 
g > 1. Hint. Use (9.17) to get dim O(«k @ E,) — dimO(E;") = g — 2. Then show 
that dim O(B, ') = dim M(—p) = 1 if g > 1. (Cf. Proposition 9.9) 

Consider 0, : H*(M,«) + H*~'(M,« @&%) = H*~1(M). Show that the range 
of 0, has codimension 1. Hint. As in (9.8), the adjoint is -O : H'~*(M) > 
H*°(M,f&). 

Let u; be meromorphic 1-forms on neighborhoods O; of p; (1 < k < K), with poles 
at p;. Use Exercise 12 to show there exists u € M(«), holomorphic on M \ {p;}, 
such that u — uj|o, is pole free for each j, if and only if pa Resp, uj = 0. 

Let E — M be a holomorphic vector bundle over a compact Riemann surface, of 
rank k. That is, each fiber E,, has complex dimension k. Modify the proof of Theorem 
9.1 to show that 


dim O(E) — dim O(’ @ x) = 1 (E)[M] — Fex(n) [M]. 


Here E’ is the dual bundle of F. (Hint. Obtain an analogue of (9.15) and use Ch(E) = 
Tre~®/?7* as in (6.36), where © is the End(£)-valued curvature form of a connection 
on &, to get 


e 1(")/2Ch( EF) _ ci(e) = Fex(n).) 


Use Theorem 8.1 to formulate a version of the Riemann-Roch theorem for a compact, 
complex manifold of higher dimension. 


10. Direct attack in 2-D 


Here we produce a direct analysis of the index formula for a first-order, elliptic 
operator 


(10.1) DiC?(M, by) 3 CPO, By) 


of Dirac type when dim M = 2. In view of (5.11), if k; (t,x, y) are the integral 
kernels of e~'?"? and e~#?”", j = 0,1, then 


(10.2) kj (t,@,@) ~ ajo(x)t—* + aj1(@) + ajo(x)t+---, 
ast \, 0, and 
(10.3) Index D = iicne —ayi(x)] dV(z). 

M 


As shown in Chap. 7, §14, we can produce explicit formulas for a;; (a) via calcu- 
lations using the Weyl calculus. 


D 


Thus, pick local frame fields for Kg and E} so that, in a local coordinate chart, 


= A(X, D), with 


10. Direct attack in 2-D 381 
(10.4) = 5° A;(2)& + C(2), 
a I x K matrix-valued symbol. Assume that 


D*D= g(X, D) ly lo(X, D) ai Bo(x), 


(10.5) DD* = g(X, D) + 4:(X, D) + Bi(a), 


where g(a, €) defines a metric tensor, while 2;(x,€) and B, (a) are K x K matrix- 
valued, and 


(10.6) f(x, €) = Se (a)&; 
j 


By (14.86) of Chap. 7, we have the following: 


Proposition 10.1. Jf D is an operator of Dirac type satisfying the hypotheses 
above and dim M = 2, then Index D is equal to 


(10.7) z/ Tr ye Gy = e (x)?] + Tr [By(x) — Bo(x)] > dV. 
M j 


Of course, the individual terms in the integrand in (10.7) are not generally 
globally well defined on M; only the total is. We want to express these terms 
directly in terms of the symbol of D. Assuming the adjoint is computed using 
L?(U, dx), we have D* D = Lo(X, D) and DD* = L,(X, D), with 


Lo(2,€) = A(z, 6)" Ala,8) + 54", A}, 
(10.8) ; 
In(a,€) = Ale, 6)A(@, €)* + £4, 4°}. 
Hence 
box, ) = Aa(@, 8)°C(2) + Cla)" Arle, 8) + S{Ab Ait, 
(10.9) : 
(2, €) = Ar(#,8)O(@)" + C(a) Ar(w,6)" + 5{A1, Ai}, 


where Aj(x,€) = >> A;(x)€;, and 


Bo (at) = Ce)" C(w) + ${C*, Ar} + S{AL,C}, 
(10.10) 


By(x) = C(a)C(x)* + “0, At} + F{A1, O°}. 


382 10. Dirac Operators and Index Theory 


Suppose that, for a given point zo € M, we arrange C(xo) = 0. Then 


ie py ERR OAT OA, OAT OA, 


(10.11) . . 
i i OA, OA} OA, OA} 
t1 (20,8) = 541, Ai} a oe one 
and 
Bo(#o) = 5{C*, Ai} + 5 {47,0} 
i OC* OA, OAT OC 
=k Ox; Ob; | OG; 7 
(10.12) i F 
By (x0) = re Ay} + tA, C"} 
-: OC OA} OA, OC* 
— 2 a Ox; 0&; 0&; - 
Note that if A;(x,€) is scalar, then f9(2%9,€) = —01(xo,€) (granted that 


C(ao) = 0). Hence their contributions to the integrand in (10.7) cancel. Also, if 
Aj(a, €) is scalar, then By(x9) = —Bo(ao). Thus, at xo, the integrand in (10.7) 
is equal to 


(10.13) 2Tr By(xo) = —Tr ye (az 20 4c —) 


a Ox; 


in this case. This situation arises for elliptic differential operators on sections of 
complex line bundles. In such a case, C(x) is also scalar, and we can rewrite 
(10.13) as 


(10.14) —2Im Tage 
vj 


Let’s take a look at the operator Dr; : C®°(M, L) + C°(M,L ® &), where 
M is a Riemann surface, L — M is a complex line bundle, with a Hermitian 
metric and a metric connection V, and, for a vector field_X, 


(10.15) (Dru, X) =VxutiVyxu. 


This is the same as (9.11)-(9.12), up to a factor of 2. Here J is the complex 
structure on JT’. We can assume / has a Riemannian metric with respect to 
which J is rotation by 90°. Pick 7» € M. Use a geodesic normal coordinate 
system centered at rg, so the metric tensor g;;, satisfies 


(10.16) Va9jx(ao) = 0. 


10. Direct attack in 2-D 383 


Let X(2o) = 0/021, and define X by parallel transport radially from xo (along 
geodesics). Then 


(10.17) X(x) = at(a) - cia) 

with 

(10.18) at(xp)=1, a2(xo)=0, Vai (xo) = 0. 
Furthermore, 

(10.19) JX(x) = abla) + a) 
with 

(10.20) ad(xp) =0, a8(xo)=1, Va3(xo) = 0. 


Next, let y be a local section of L such that y(xo) has norm 1, and y(x) is ob- 
tained from (2g) by radial parallel translation. Thus 


(10.21) u = vy Va, u = (O;0 + 16;0) 9, 
where the connection coefficients satisfy 
(10.22) 6; (xo) = 0. 


In such a coordinate system, and with respect to such choices, the operator D; 
takes the form 


1 Ov 
(10.23) D,(vy) = plas — A,0;v] 9 29, 
where 
(10.24) A; = i(ai + ia}) 


and where 0 € C™(U, &) satisfies 
(X,0)=1, (JX,V) =1. 


Then D} : C@(M,L @&) + C™(M, L) is given by 


1 —_— Oo = aati 
(10.25) Dz (we @0) == dog? |Aya— + (jA; + A,8,)| (G'/?w)o. 
| 


384 10. Dirac Operators and Index Theory 


Now we want to take adjoints using L?(U, dx) rather than L?(U, ,/gdx), so we 


conjugate by g'/4, and replace Dz, by 


S 1 O, _ 
(10.26) by==>- lo *Aia(o W/4y) — A;0;0. 


Thus we are in the situation of considering an operator of the form (10.4), with 
Aj given by (10.24) and 


7 idA; ,, 1.109, 
(10.27) cw) =D|5 On, Aj; — 39 peas) 


Thus C(x) = 0, by (10.18)-(10.22), while 


L 1 
(10.28) OnC(a9) = S- [-Ay(0%6%) ane 5 OvOyA; = 7A(Oi2s0) ; 


Z: 
Now 0;6;(0o) is given by the curvature of V on L: 


00; 1 


Meanwhile, as shown in §3 of Appendix C, 0,0; A, can be expressed in terms of 
the Riemannian curvature: 


1 


1 
(10.30) O;Ona*, (ao) = g fei — a Rermj, 


6 


and of course so can 0,0; 9(%0). Consequently, at xo, the formula (10.14) for the 
integrand in (10.7) becomes 


2 1 
(10.31) —5Fi2 + 55(20).- 


Note that S'/2 = K, the Gauss curvature. Thus the formula (10.7) becomes 


1 2 
Index D; = — [ (FF. a K) dV 
An 1 

M 


1 1 
7 aq f i [Kav 
M 


M 


(10.32) 


where wy, is the curvature form of L. We have the identities 


(10.33) 


Oni 


1 1 
wr=a(L)IM), = f Kav =5x(M), 
M M 


10. Direct attack in 2-D 385 

the latter being the Gauss—Bonnet theorem. 

Now, if L — M is a holomorphic line bundle, then (1/2)Dz has the same 
principal symbol, hence the same index, as 
(10.34) 0, : C™(M, L) — C™(M,L@R&). 
Hence we obtain the Riemann—Roch formula: 

a; 1 

(10.35) Index O07 = c1(L)[M] + gX(M), 
in agreement with (9.17). 

We finish with a further comment on the Gauss—Bonnet formula; (J) is the 
index of 


(10.36) d+6:A°M @A?M — A'M 


if dim M = 2. If M is oriented, both AM and (A° @ A?)M get structures of 
complex line bundles via the Hodge * operator; use 


(10.37) J=x*xonAt, J=—*:A° 5 A?, Jae: A727 5 A°. 


It follows easily that (d+ 6)J = J(d+4 4), so we get a C-linear differential 
operator 


(10.38) 9: A-M — A.M, 
where A, = A° @ A?, A, = A!, regarded as complex line bundles, so 
1 
Index 0 = 5 Index(d + 4). 


Ker v is a one-dimensional complex vector space: 
Ker ) = span(1) = span(*1). 


The cokernel of d + 6 in (10.36) consists of the space H, of (real) harmonic 1- 
forms on M. This is invariant under *, so it becomes a complex vector space: 


1 
(10.39) dime H, = 3 dimpy H; = g. 
Thus 


i 
(10.40) Index 9 = 5(2~ 2g) =1—g. 


386 10. Dirac Operators and Index Theory 


When one applies an analysis parallel to that above, leading to (10.32), one gets 


1 
(10.41) Index 3 = 7; [Kw 
An 
M 


Putting together (10.40) and (10.41), we again obtain the Gauss—Bonnet formula, 
for a compact, oriented surface. 


Exercises 
1. Use (10.36)—(10.39) to give another proof of (9.24), that is, 
dim O(K) = 5 dim H* (M,C) = g. 


In Exercises 2-4, suppose FE; — M are complex line bundles over M, a compact 
manifold of dimension 2, and suppose 


D:C®(M, Eo) — C™(M, F1) 


is a first-order, elliptic differential operator. 
2. Show that the symbol of D induces an R-linear isomorphism 


(10.42) op(a):T; —> L(Eoz, Fiz). 


Hence M has a complex structure, making this C-linear. This gives / an orientation; 
reversing the orientation makes (10.42) conjugate linear. 

3. If M is oriented so that (10.42) is conjugate linear, show that D has a principal symbol 
homotopic to that of Dz, given by (10.15), with L = Eo, LD @K & E}. Deduce that 


1 1 
(10.43) Index D = 3° (Lo) [M] + go (Z1)[M]. 
4. What happens to the formula for Index D*? 
In Exercises 5-8, S27* denotes the bundle of symmetric second-order tensors with 
trace zero on a Riemannian manifold M, and Sj’ denotes the bundle of symmetric 
tensors of type (1, 1) with trace 0. The metric tensor provides an isomorphism of these 
two bundles. 


5. If M is a compact, oriented 2-fold, with associated complex structure J : T;M — 
TM, show that a complex structure is defined on ing C Hom T;, by 


(10.44) 3(A) = lt A) =JA. 


Thus a and S@T* become complex line bundles. 
6. Recall the first-order operator considered in (4.29)-(4.31): 


(10.45) Derr :C™°(M,T) —+ C®(M, S97"), DreX = Def X — 5 (iv X)g, 


11. Index of operators of harmonic oscillator type 387 


incasen = dim M = 2. If T and S$7™ are regarded as complex line bundles, show 
that Dr pr is C-linear. 

7. Recall that ker Drr consists of vector fields that generate conformal diffeomorphisms 
of M, hence of holomorphic sections of T = «~*. Show that there is an isomorphism 
SéT* = «+ @ & transforming (10.45) to 


(10.46) 3d: C™~(M, 61) —+ C®(M, «7! @R). 


Note that Index 0 = —(3g — 3) in this case, if g is the genus of M. 
8. In view of (4.30), the orthogonal complement of the range of Dr is the finite dimen- 
sional space 


(10.47) V = {u € C™(M, SGT") : div u = 0}. 


Comparing (10.45) and (10.46), show that V + O(« @ «). If M has genus g > 2, din 
(10.46) is injective (by Exercise 12, §9). Deduce that 


(10.48) dimg V = 69 —6, ifg > 2. 


Compare Exercise 5 of §9. For g = 0, compare Exercise 7 of §4. 
For connections with the dimension of Teichmuller space, see [Tro]. 


11. Index of operators of harmonic oscillator type 


In this section we study elliptic operators of harmonic oscillator type, introduced 
in §15 of Chap.7. We recall that a symbol p(x, &) belongs to S{”(R”) if it is 
smooth in (a, €) € R” x R” and satisfies estimates 


(1) |DEDEp(x,£)| < Caa(t + lal + lela!" 


The associated operator P = p(X, D) € OPS{"(R”) is defined using the Weyl 
calculus. The operator is elliptic provided that, for |x|? + |€|? large enough, 


(11.2) p(w, €)~*| $ CA + [al + |g1)-™. 
In such a case, P has a parametrix Q € OPS," (R”), such that PQ—J and QP— 


I belong to OPS; (R”). The class S™(R”) of classical symbols is defined to 
consist of elements p(x, €) € Sj”(IR”) such that 


(11.3) n(z,6)~ > ,(@, 6), 


j20 


where p;(x,£) € S{"~~ (IR) is homogeneous of degree m — 2j in (a,£) for 
|r|? + |€|? > 1. If such a symbol satisfies the ellipticity condition (11.2), then 
P = p(X, D) has parametrix Q € OPS~™(R”). A paradigm example of such 
an operator is the harmonic oscillator 


388 10. Dirac Operators and Index Theory 
(11.4) H=-A+\|z)?, 


which is elliptic in OPS?(R"), with symbol |x|? + |€|?. It is a positive definite 
operator, and, as shown in Chap. 7, 


(11.5) H*® ¢ OPS**(R"), VseER. 

There are Sobolev-type spaces H°(R”), s € R, such that, fors =k € ZT, 
(11.6) A(R") = {u € L?(R”) : 27 D8u € L?(R”), Vial +|6] < k}. 
As shown in Chap. 7, if P € OPS™(R"), then, for all s € R, 

(11.7) P:H°(R") — H*-™(R”), 


and if P is elliptic, this map is Fredholm. We want to study its index. For simplic- 
ity, we stick to operators with symbols of classical type. 

If P = p(X, D) is an elliptic operator (k x k matrix valued), with symbol 
expansion of the form, we call po(x,€) the principal symbol. Recall we assume 
po(x, €) is homogeneous of order m for |a|? + ||? > 1. We then have the symbol 
map 


op: 8°"! + G&k,C), 


(11.8) 
op(«, €) =. po(«, €), |x|? Ye lg? =1. 


Note that P € OPS™(R"”) and PH“ € OPS™*?+(R”) have the same symbol 
map, and they have the same index, one on H*(R”) — H*—™(R”) and the other 
on H*(R") > H°—™~ 2 (R”). Basic Fredholm theory gives the following. 


Proposition 11.1. Given elliptic k x k systems P; € OPS™5(R"), if op, and 
op, are homotopic maps from $7"! to G(k, C), then Index P, = Index P». 


Let us take n = 1 and k = 1 and look for specific index formulas. In this case, 
given elliptic scalar P € OPS™(R), we have 


(11.9) op: 5’ — G&1,C) =C\0. 
Such a map is specified up to homotopy by the winding number 


Lf op) 


11.1 ind ere 
et NOE oa eae) 
si 


dc, 


where ¢ = « + 2€. If P, and Py are two such elliptic operators, we have 


(11.11) Index P; Pz = Index P, + Index P,, 


11. Index of operators of harmonic oscillator type 389 
and 
(11.12) indgop, p, = indop, + indgp,. 
Let us consider the operator 


(11.13) D, = soe eae. 
Oxy 


acting on functions of 7; € R. Its symbol is x; + 7), so 
(11.14) indgp, =1. 


Note that D} = —0O; + 21, and 


(11.15) DID, =-O?+a7-1, D,Dt =—-—O?4+a7+1. 
We have 
(11.16) Ker D, = Span{e~*i/?}, and 
DD} > 21 = Ker Dj = 0, 
hence 
(11.17) Index D; = 1. 


Putting together (11.9)-(11.17) and Proposition 11.1, we have the following. 
Proposition 11.2. If P € OPS™(R) is a scalar elliptic operator, then 
(11.18) Index P = indop. 

We next keep n = 1 and let P € OPS™(R) be an elliptic k x k system, so 
(11.19) op: S' — Gé(k,C). 


We want to classify these maps, up to homotopy. To do this, we bring in the 
following topological fact about 


(11.20) Se(k,C) = {A € Gl(k,C) : det A = 1}, 
namely 
(11.21) S¢(k,C) is simply connected. 


Using this fact, we prove the following. 


390 10. Dirac Operators and Index Theory 


Proposition 11.3. Given a symbol map (11.19), define Gp : St + Gl(k, C) by 


(11.22) eetee) = & aelose) ) | 


where I denotes the (k — 1) x (k — 1) identity matrix. Then op and Gp are 
homotopic. 


Proof. Given (11.19) and (11.22), we set 

(11.23) "1 = Gpop':S' — S&k,C). 

Using (11.21), we can deform +; to yo = I, through y, : S1 + Sé(k,C), 0 < 
T < 1.A homotopy from op to Gp is then given by of (2, €) = yr(@,€)op(#,§), 
O0<7<1. 


We have a scalar operator Pe OPS*™(R), defined uniquely mod 
OPS*™—?(R) by the condition 


(11.24) op = det op. 
Then 

P 
(11.25) Index P = Index ( i ; 


which by Proposition 11.2 is given by ind det ap. We have proved the following. 
Proposition 11.4. If P © OPS™(R) is an elliptic k x k system, 
(11.26) Index P = ind det op. 
Returning to (11.21), we note that it is equivalent to the result 
(11.27) SU(k) is simply connected. 
To see this, we use the polar decomposition 
(11.28) Aé Gl(k,C) = A=U(A)I(A), 


where 
TI(A) = (A*.A)!/? is positive definite, 


U(A) = A(A*A)~1/? € U(k). 


With this, we can define a 1-parameter family of maps 


11. Index of operators of harmonic oscillator type 391 


(11.29) 8, : G&(k,C) —+ Ge(k,C), 7 € [0,1], 
by 

(11.30) 0,(A) = U(A)M(A)’. 

We have 

(11.31) 8o(A) = U(A), 01(A) =A. 


This makes U(k) a deformation retract of G¢(k,C). As a consequence, each con- 
tinuous map a : S?"~! — Gé(k, C) is homotopic to the map Jo oa : S2”~1 > 
U(k). Note that 


det A 
(11.32) det II(A) =|det A], det U(A) = race ar 
so 
(11.33) Jo : SE(k,C) —> SU(k), 


and J, makes SU(k) a deformation retract of Sé(k,C). This establishes the 
equivalence of (11.21) and (11.27). In case k = 2, we have 


(11.34) su) = {( 7) :a,b€C, |a|? + |b)? = i} as, 
which is clearly simply connected. For k > 2, (11.27) is a special case of (11.56) 
below. 

Let us now take n > 2 and consider a k& x k elliptic system P € OPS™(R"), 
giving a symbol map (11.8). Making use of the deformation retract (11.29)- 
(11.31), we see that ap is homotopic to a symbol map 


(11.35) ope : S27! —4 U(k), 


for an operator P# € OPS™(R"), uniquely defined mod OPS™~?(R"), and 
Index P = Index P*. For k = 1, we have the following topological result. 


Lemma 11.5. If n > 2, every continuous map o : S?"~' — U(1) = S' is 
homotopic to a constant map. 


Proof. Indeed, since S?"~! is simply connected for n > 2, o lifts to a continuous 
map @ : $?”~! —; R, which is clearly homotopic to a constant map. 


In light of this, if we have (11.35) and set (as in (11.22)) 


392 10. Dirac Operators and Index Theory 


det op#(x,€) 0 


*) , oS? SU(k), 


(11.36) 6(a,€) = ( 


then, for n > 2, o is homotopic to a constant. Hence o px and 
(11.37) a: $771 5 SU(k), o°(a,€) = G(x, £)*opa(z, 6), 


are homotopic. Given 4s € R, this is the symbol map of an operator Pe 
OPS#*(IR"), uniquely determined up to a lower order operator. We have the fol- 
lowing result. 


Proposition 11.6. For n > 2, if P € OPS™(R”) is an elliptic k x k system, 
there exists for each 4 € R an elliptic k x k system P € OPS*"(R”) whose 
symbol map 


(11.38) ap: S°"-! —+ SU(k) 
is homotopic to op, as maps S?"~+ —+ Gé(k, C). Hence 
(11.39) Index P = Index P. 
Let us now specialize ton = 2. By Lemma 11.5, every scalar elliptic P € 


OPS" (R?) must have index 0. We construct an elliptic 2 x 2 system with nonzero 
index as follows. With D, as in (11.13), set 


O1 Ty On — @9 
D = 
: & + &9 -—O; -) 


(11.40) 
_(i Ly 
Nag ap 
where 
(11.41) Ig=024+%2, Ly =-0.4 22. 
Note that 


(11.42) op, = (=: T if —xQ ay , 80 @p,:5S° > SUQ)=S° 
tq + 1&2 £1 — 1&2 


is essentially the identity map. A computation gives 


(11.43) DED, = ee + L5L2 


D, Di + ie , 


and 


11. Index of operators of harmonic oscillator type 393 


(11.44) D2D* = (Ga + L3L2 


DD, + ie ; 
We recall the formulas for D} D; and D, Dj in (11.15). Similarly, 
(11.45) LiL, =—02 +a2-1, Lelh = —02 +22 +1. 


Hence Ker Lz = Span{e~*2/2} and L2L5 > 21, and we have for the four diag- 
onal elements of (11.43)—(1 1.44) that 


dim Ker(D* D; + L3L2) = 1, 


D, Di + [2Ls > 4I, 
(11.46) 
D, Di + L3L2 > 21, 
Di D,4+ L2L5 > 21. 
Hence 
(11.47) dim Ker D2 =1, dimKer D3 =0, 
so 
(11.48) Index Dz = 1. 


Now consider an arbitrary 2 x 2 elliptic system P € OPS™(R?). As in (11.38), 
we have an adjusted operator P, with the same index as P, and 


(11.49) op: S° —+ SU(2) = S°. 


The homotopy class of this map is an element of 773 (5°). Results on this homotopy 
group, which we will discuss in more detail below, imply the following. 


Proposition 11.7. Let P € OPS™(R?) be a 2 x 2 elliptic system. For og as in 
(11.49), there is a unique integer £ such that either 
€>0 and op is homotopic to ops; 
(11.50) £=0 and Op is homotopic to a constant map, 
€<0 and op is homotopic to O(Ds)lel+ 
We denote this ¢ by 
(11.51) €=indos. 


Then 


394 10. Dirac Operators and Index Theory 
(11.52) Index P = Index P = indo. 


To see that (11.52) follows from (11.50), note that in the first case Index P = 
Index D§ = ¢ and in the third case Index P = Index(D3)!“! = —|é¢| = @. 

We now discuss some homotopy theory behind (11.50). It is convenient to 
place this in a more general setting. If M is a smooth, connected manifold and 7 € 
N, 7; (J) denotes the set of homotopy classes of continuous maps y : S/ + M 
(which is equivalent to the set of homotopy classes of smooth maps). This can be 
given a group structure as follows. Fix po € $7, go € M. Given maps y, ~ : 
SJ —. M, one can produce maps homotopic to these that take po to qo, So assume 
y and w have this property. Now take S/ and collapse its “equator,” which is 
homeomorphic to $4~!, to a point. You obtain two copies of $7, joined at a point, 
which we identify with po. Then map the top sphere to M by w and the bottom 
sphere to M by y, and compose with the collapse map, to geta mapa : S/ + M, 
whose homotopy class [a] = [y] - [y]. 

In case G is a connected Lie group, there is another way to define a product 
on 7;(G). Namely, ify, : S? — G, consider the map y- ~ : SJ > G given 
by (py: v)(x) = y(x)v (za), using the product on G. If y and ¢ are homotopic 
(write y ~ ~) and also = ~ dv, we have yp: ~ —- w, so this gives a product 
on 7;(G). It is a basic fact that this product on 7;(G) agrees with the previously 
defined one; cf. [Spa], Chap. 1. 

What makes (11.50) work is the 7 = 3 case of the following fundamental result 
of H. Hopf. 


Proposition 11.8. For each j € N, 
(11.53) m4(97) = Z, 
and (the homotopy class of) the identity map Id : S1 —> S! is a generator. 


In fact, if y, 7 : S? — $% are smooth, they have degrees, defined in Chap. 1, 
819, and the Hopf theorem says they are homotopic if and only if they have the 
same degree. Cf. [Spa], p. 398. 

Under the identification (11.34) of SU(2) with (a,b) € $3, ap, : S? > $° 
is the identity map, and o pe 73(S°) is an ¢-fold product, hence corresponds 
to ¢ € Z under this isomorphism, while op; = —1 € 73(S°*), and F(ps)\el = 
—¢| € 73(9°). 

Let us next consider a k x k elliptic system P € OPS™(R?), giving rise to 
P# as in (11.35) and P asin (11.38), all having the same index. The following 
result is useful. 


Proposition 11.9. For each k © N, the natural inclusion SU(k) = U(k) 
induces an isomorphism 


(11.54) m3(SU(k)) => 1j(U((k)), if 7 >1. 


11. Index of operators of harmonic oscillator type 395 


Furthermore, the inclusions U(k) — U(k + £) and SU(k) > SU(k + 8), given 
by 


(11.55) AW 6 ) , 

I 
where I denotes the € x £ identity matrix, induce isomorphisms 
(11.56) 


my(U(k)) “+ nj(U(k+Q), mi(SU(K)) = y(SU(K+9), if 9 <2k-1. 

We mention that a proof of (11.54) requires just a few arguments beyond the 
proof of Proposition 11.6. The proof of (11.56), with € = 1, which then proceeds 
inductively, follows by applying the “homotopy exact sequence” to 


(11.57) U(k+1)/U(k) = S41, SU(k+1)/SU(k) = S74. 


See (11.83) below. According to Proposition 11.9, when 7 = 3, (11.56) holds for 
k > 2. Taking (11.53) into account, we have 


(11.58) 13(SU(k)) © 73(U(k)) ¥Z, Vk>2. 
We can now augment Proposition 11.7 as follows. 
Proposition 11.10. Let P € OPS™(R?) be ak x k elliptic system, k > 2. For 
Ges S3 — SU(k) as in (11.49), there is a unique integer € such that, with I 
denoting the (k — 2) x (k — 2) identity matrix, either 
€>0 and og is homotopic to ("" : : 
(11.59) £=0 and og is homotopic to a constant map, 


£<0 and og is homotopic to (en ; . 


We denote this € by 

(11.60) €=indos. 

Then 

(11.61) Index P = Index P = indo. 


We now turn to higher dimensions. Our next task is to construct, for each 7 an 
elliptic system D; € OPS 1 (RJ) (actually a system of differential operators) of 


396 10. Dirac Operators and Index Theory 


index |. The construction is inductive. Assume we have such an elliptic system 
D,y—-1, with the properties 


(11.62) dim Ker D,,_; = 1, 
and 
(11.63) Dn—1 DF, > 21. 


By (11.15)-(11.16) we have this for n — 1 = 1, and by (11.43)-(11.47) we have 
this for n — 1 = 2. We then set 


(11 64) D = Dy-1 On —In — Dn-1 —L 

" On+t, D*_, a.  Deay 
where 
(11.65) In =Ont+2n, LF = —-Ont+ fn. 


Parallel to (11.43)-(11.44), a computation gives 


De 2. 4 
11. Pp = (Oe ee 
(11.66) n ( DAD of Dy 7 : 
and 
(11.67) Da (a ee , 
Di D4 il 


Parallel to (11.45), we have 
(11.68) Pi,=—F te =1, [fh ae +1, 


We see that L,, annihlates en tn/ ? and L,L* > 2I. Hence, parallel to (11.46), we 
have 
dim Ker(D?,_,Dn-1+ L7L,) = 1, 

Dy-iD;_, + Ini, > 4l, 

Dn—-1DF_ + LA Ly, > 21, 

DF Dn-1 + Ln L* > 21. 


(11.69) 


Consequently, we have 


(11.70) dim Ker D,, = 1, 


11. Index of operators of harmonic oscillator type 397 


and 

(11.71) D,D* > 21, 
hence 

(11.72) Index D,, = 1. 


This completes the inductive construction. Note that the matrix doubles in size at 
each iteration, so D,, isa 2”—! x 2”—! matrix of differential operators. 

We can extend Proposition 11.10, using the following fundamental result of 
R. Bott. Cf. [Mil], §23. 
Proposition 11.11. Forn € N, 
(11.73) Tan—1(U(k)) © Tani (SU(k)) = Z, if k>n. 


Note that (11.58) is the case n = 2 of this result. Given this proposition, the 
calculation (11.72) implies the following. 


Proposition 11.12. For n € N, the map 
(11.74) Op, : gen-l _¥ va) 
defines a generator of Tan—1 (U(2"-1)). 


Note: The calculation (11.66) implies op: p,,(#,€) = op,,(x,€)*op,,(#, €) = I, 
for |x|? + |é|? = 1. 


From here, we have the following extension of Proposition 11.10. 


Proposition 11.13. Let P © OPS™(R”) be ak x k elliptic system, with associ- 
ated symbol map a5 : S2r-l _, SU(k). Ifk = 2-1, there is a unique integer £ 
such that either 


£>0 and op is homotopic to ope, 
(11.75) €=0 and og is homotopic to a constant map, 


€<0 and og is homotopic to O(DsyIel- 
we denote this £ by 
(11.76) £=indos. 


Then 


398 10. Dirac Operators and Index Theory 
(11.77) Index P = Index P = indo. 


Ifk< 2”-! then Index P = Index P = Index P, where 


(11.78) Op = ( 1) 


I being the (2"~1 —k) x (2"~1—k) identity matrix, and the considerations above 
apply to give Index P, hence Index P. 
If k > 2"—1, then there is a unique integer ¢ such that either 


€>0 and op is homotopic to (‘ / ; 
(11.79) €=0 and og is homotopic to a constant map, 
€<0 and og is homotopic to (7 : : 


I being the (k — 21) x (k — 2”~*) identity matrix, and analogues of (11.76)- 
(11.77) hold. 


Remark: An integral formula for Index P is given in [Fed]; see also [Ho]. 

Also of use in index theory is the following complement to Proposition 11.11. 
Proposition 11.14. Given k > 1, 
(11.80) j € {1,3,...,2k —1} == 2,(U(k)) is finite. 


Thanks to Shrawan Kumar for mentioning this and for explaining the proof, 
which we now sketch. One ingredient is the result that 


(11.81) m;(S?*-1) is finite for all 7 # 2k — 1. 
See [Spa], p. 515. The proof of (11.80) goes by induction on k. The case k = 1 
is clear. The case k = 2 follows from (11.54), which reduces (11.80) with k = 2 


to the assertion that 7;(SU(2)) = ;(S*) is finite for 7 A 3. To do the inductive 
step, we assume that 


(11.82) j#{1,3,...,2k — 3} => 1,;(U(k—1)) is finite, 


and aim to deduce (11.80). Another ingredient for this is the homotopy exact 
sequence for U(k)/U(k — 1) = $?*-!, which includes the segment 


11. Index of operators of harmonic oscillator type 399 
(11.83) mj41(S2*-}) > 2,(U(k — 1)) 3 4; (U(k)) 9 13(S2*-), 
cf. [Mil], p. 128. We tensor with Q, denoting 7;(X) ® Q by n?(X). 


(11.84) 2.4 (S?41) + w2(U(k —1)) > rP(U(h)) > 1 O( S744). 


By (11.81), 
(11.85) ay (St) = 0 if pb 2k = 1, 

Thus 

(11.86) j ¢ {2k — 2,2k — 1} => a2(U(k)) & t2(U(k 1). 


With this, (11.82) leads to 
(11.87) 72(U(k)) =0 if j ¢ {1,3,...,2k—3} and j ¢ {2k — 2,2k — 1}. 


On the other hand, setting 7 = 2k — 2 in (11.84) gives 


(11.88) Q> 12_,(U(k-1)) > 73_,(U(k)) > 0, 
NiO) 
(11.89) ts, Uh — 1) 0a, (0G) = 0, 


giving (11.80). 


See the exercises for an application of Proposition 11.14. 


Remark: S. Kumar has also shown the author how further arguments yield, 
for k > 2, 


Tor+i(U(k)) =0 if k is odd, 
(11.90) 
Z/(2) if k is even. 
In case k = 2, one has 


(11.91) m5(U(2)) = 5(SU(2)) = 15(S°) = Z/(2). 


See [Spa], p. 520. 


400 10. Dirac Operators and Index Theory 


Exercises 


aE 


Give a Clifford algebra description of the operator D,, in (11.64). 
. Show that if k > n, there exists foreach 4 € Zak x k elliptic system 


P €OPS™(R") such that Index P = é. 


3. Suppose you know that 


Tan—1(U(k)) is a finite group. 


(By (11.73) this would require k < n.) Show that if P € OPS™(R”) isak x k elliptic 
system, 


Index P = 0. 


(Hint. Index P? = 7 Index P.) 
4. Using Exercise 3 and Proposition 11.14, show that if P € OPS™(R")isak xk 
elliptic system, 


k <n => Index P = 0. 


References 
[An] M. Anderson, Ricci curvature bounds and Einstein metrics on compact manifolds, 
J. AMS 2(1989), 455-490. 
[ABP] M. Atiyah, R. Bott, and V. Patodi, On the heat equation and the index theorem, 
Inventiones Math. 19(1973), 279-330. 
[ABS] M. Atiyah, R. Bott, and A. Shapiro, Clifford modules, Topology 3(1964), 3-38. 
[AHS] M. Atiyah, N. Hitchen, and I. Singer, Self-duality in four-dimensional Riemannian 
geometry, Proc. Royal Soc. London A 362(1978), 425-461. 
[AS] M. Atiyah and I. Singer, The index of elliptic operators I, Ann. Math. 87(1968), 
484-530; II, Ann. Math. 87(1968), 546-664; IV, Ann. Math. 93(1971), 119-138; 
V, Ann. Math. 93(1971), 139-149. 
[BDT] P. Baum, R. Douglas, and M. Taylor, Cycles and relative cycles in analytic K- 
homology, J. Diff; Geom. 30(1989), 761-804. 
[BHS] P. Baum, N. Higson, and T. Schick, On the equivalence of geometric and analytic 
K-homology, Pure Appl. Math. Q. 3 (2007), 1-24. 
[Ber] F. Berezin, The Method of Second Quantization, Academic, New York, 1966. 
[BGM] M. Berger, P. Gauduchon, and E. Mazet, Le Spectre d’une Variété Riemannienne, 
LNM #194, Springer, New York, 1971. 
[BGV] N. Berline, E. Getzler, and M. Vergne, Heat Kernels and Dirac Operators, 
Springer, New York, 1992. 
[BV] N. Berline and M. Vergne, A computation of the equivariant index of the Dirac 
operator, Bull. Soc. Math. France 113(1985), 305-345. 
{[BV2] N. Berline and M. Vergne, A proof of Bismut local index theorem for a family of 
Dirac operators, Topology 26(1987), 438-464. 
[Bes] A. Besse, Einstein Manifolds, Springer, New York, 1987. 
[Bi] J. Bismut, The Atiyah—Singer theorems for classical elliptic operators, a proba- 
bilistic approach, J. Func. Anal. 57(1984), 56-99. 
[Bi2] J. Bismut, The Atiyah—Singer index theorem for families of Dirac operators: two 


heat equation proofs, Invent. Math. 83(1986), 91-151. 


[BiC] 
[B1] 
[BTu] 
[Ch] 
[Cher] 
[Chy] 


[Con] 
[CS] 


[D] 


[Don] 
[EGH] 


[Fed] 
[FU] 
[Gtl] 
[Gt2] 
[Gil] 
[Gu] 
[Har] 
[Hi] 
[Ho] 
[Hir] 
[KN] 
[Kol] 
[Ko2] 
[Kot] 
[LM] 


[Lic] 


References 401 


J. Bismut and J. Cheeger, Families index for manifolds with boundary, 
superconnections, and cones, I, J. Func. Anal. 89(1990), 313-363; II, J. Func. 
Anal. 90(1990), 306-354. 

B. Blackadar, K-theory for Operator Algebras, Springer, New York, 1986. 

R. Bott and L. Tu, Differential Forms in Algebraic Topology, Springer, New York, 
1982. 

J. Cheeger, Analytic torsion and the heat equation, Ann. Math. 109(1979), 
259-322. 

S. S. Chern, A simple intrinsic proof of the Gauss—Bonnet formula for closed 
Riemannian manifolds., Ann. Math. 45(1944), 747-752. 

C. Chevalley, Theory of Lie Groups, Princeton University Press, Princeton, NJ, 
1946. 

A. Connes, Noncommutative Geometry, Academic, New York, 1994. 

H. Cycon, R. Froese, W. Kirsch, and B. Simon, Schrédinger Operators, Springer, 
New York, 1987. 

S. Donaldson, The Seiberg—Witten equations and 4-manifold topology, Bull. AMS 
33(1996), 45-70. 

H. Donnelly, Local index theorems for families, Mich. Math. J. 35(1988), 11-20. 
T. Eguchi, P. Gilkey, and A. Hanson, Gravitation, Gauge Theories, and Differential 
Geometry, Physics Reports, Vol. 66, no. 6(1980). 

B. Fedosov, Direct proof of the formula for the index of an elliptic system in 
Euclidean space, Funct. Anal. Appl. 4(1970), 339-341. 

D. Freed and K. Uhlenbeck, Instantons and Four-Manifolds, Springer, New York, 
1984. 

E. Getzler, Pseudodifferential operators on supermanifolds and the Atiyah—Singer 
index theorem, Comm. Math. Phys. 92(1983), 163-178. 

E. Getzler, A short proof of the local Atiyah—Singer index theorem, Topology 
25(1986), 111-117. 

P. Gilkey, Invariance Theory, the Heat Equation, and the Atiyah—Singer Index 
Theorem, CRC, Boca Raton, FL, 1995. 

R. Gunning, Lectures on Riemann Surfaces, Princeton University Press, Princeton, 
NJ, 1967. 

R. Hartshorne, Algebraic Geometry, Springer, New York, 1977. 

N. Hicks, Notes on Differential Geometry, Van Nostrand, New York, 1965. 

L. Hormander, The Weyl calculus of pseudo-differential operators, Comm. Pure 
Appl. Math. 32(1979), 359-443. 

F. Hirzebruch, Topological Methods in Algebraic Geometry, Springer, New York, 
1966. 

S. Kobayashi and N. Nomizu, Foundations of Differential Geometry, Interscience, 
New York, Vol.1, 1963; Vol. 2, 1969. 

K. Kodaira, The theorem of Riemann—Roch on compact analytic surfaces, Am. J. 
Math. 73(1951), 813-875. 

K. Kodaira, The theorem of Riemann—Roch for adjoint systems on 3-dimensional 
algebraic varieties, Ann. Math. 56(1952), 298-342. 

T. Kotake, An analytical proof of the classical Riemann—Roch theorem, Proc. 
Symp. Pure Math. 16(1970), 137-146. 

H. B. Lawson and M. L. Michelson, Spin Geometry, Princeton University Press, 
Princeton, NJ, 1989. 

A. Lichnerowicz, Spineurs harmoniques, C.R. Acad. Sci. Paris, Ser. A 257(1963), 
7-9. 


402 10. Dirac Operators and Index Theory 


[MS] 
[Mel] 
[Mil] 
[MiS] 
[Mor] 

[Pal] 
[Pt1] 


[Pt2] 


[Poo] 
[Roe] 


[Rog] 


[Spa] 
[Spi] 


[Stb] 
[T] 
[Tro] 


[Wit] 


H. McKean and I. Singer, Curvature and the eigenvalues of the Laplacian, J. Diff: 
Geom. 1(1967), 43-69. 

R. Melrose, Index Theory on Manifolds with Corners, A. K. Peters, Boston, 1994. 
J. Milnor, Morse Theory, Princeton University Press, Princeton, NJ, 1963. 

J. Milnor and J. Stasheff, Characteristic Classes, Princeton University Press, 
Princeton, NJ, 1974. 

J. Morgan, The Seiberg—Witten Equations and Applications to the Topology of 
Smooth Four-Manifolds, Princeton University Press, Princeton, NJ, 1996. 

R. Palais, ed., Seminar on the Atiyah—Singer Index Theorem, Princeton University 
Press, Princeton, NJ, 1963. 

V. Patodi, Curvature and the eigenforms of the Laplace operator, J. Diff: Geom. 
5(1971), 233-249. 

V. Patodi, An analytic proof of the Riemann—Roch-Hirzebruch theorem for Kahler 
manifolds, J. Diff; Geom. 5(1971), 251-283. 

W. Poor, Differential Geometric Structures, McGraw-Hill, New York, 1981. 

J. Roe, Elliptic Operators, Topology, and Asymptotic Methods, Longman, New 
York, 1988. 

A. Rogers, A superspace path integral proof of the Gauss—Bonnet—Chern theorem, 
J. Geom. Phys. 4(1987), 417-437. 

E. Spanier, Algebraic Topology, McGraw-Hill, New York, 1966. 

M. Spivak, A Comprehensive Introduction to Differential Geometry, Vol. 1-5, 
Publish or Perish, Berkeley, 1979. 

S. Sternberg, Lectures on Differential Geometry, Prentice-Hall, Englewood Cliffs, 
NJ, 1964. 

M. Taylor, Pseudodifferential operators and K-homology, Proc. Symp. Pure Math. 
51, Part 1 (1990), 561-583. 

A. Tromba, A classical variational approach to Teichmuller theory, pp. 155-185 in 
LNM #1365, Springer, New York, 1989. 

E. Witten, Constraints on supersymmetry breaking, Nucl. Phys. B202(1982), 253. 


11 ® 


Check for 
updates 


Brownian Motion and Potential Theory 


Introduction 


Diffusion can be understood on several levels. The study of diffusion on a 
macroscopic level, of a substance such as heat, involves the notion of the flux 
of the quantity. If w(t, ) measures the intensity of the quantity that is diffusing, 
the flux J across the boundary of a region O in x-space satisfies the identity 


(0.1) ° [ue dV (x) = — / v- J dS(x), 
O 


00 


as long as the substance is being neither created nor destroyed. By the divergence 
theorem, this implies 

Ou 
0.2 — = —-div J. 
(0.2) ot 
The mechanism of diffusion creates a flux in the direction from greater concen- 
tration to lesser concentration. In the simplest model, the quantitative relation 
specified is that the flux is proportional to the x-gradient of wu: 


(0.3) J = —D grad u, 
with D > 0. Applying (0.2), we obtain for wu the PDE 


Ou 
(0.4) 5p = DAU 


in case D is constant. In such a case we can make D = 1, by rescaling, and this 
PDE is the one usually called “the heat equation.” 

Many real diffusions result from jitterings of microscopic or submicroscopic 
particles, in a fashion that appears random. This motivates a probabilistic attack 
on diffusion, including creating probabilistic tools to analyze the heat equation. 
This is the topic of the present chapter. 


© The Author(s), under exclusive license to Springer Nature Switzerland AG 2023 403 
M. E. Taylor, Partial Differential Equations I, Applied Mathematical Sciences 116, 
https://doi.org/10.1007/978-3-03 1-33700-0_11 


404 11. Brownian Motion and Potential Theory 


In §1 we give a construction of Wiener measure on the space of paths in R”, 
governed by the hypothesis that a particle located at x € R” at time t, will have 
the probability P(t, z,U) of being in an open set U C R” at time ¢; + t, where 


0.5) P(t,a,U) = / pea a) ay; 
U 


and p(t,z,y) is the fundamental solution to the heat equation. We prove that, 
with respect to Wiener measure, almost every path is continuous, and we estab- 
lish a modulus of continuity. Our choice of e’“ rather than e’4/? to define such 
probabilities differs from the most popular convention and leads to minor differ- 
ences in various formulas. Of course, translation between the two conventions is 
quite easy. 

In §2 we establish the Feynman—Kac formula, for the solution to 


Ou 


(0.6) Ot 


Au+V(a)u, 

in terms of an integral over path space. A limiting argument made in §3 gives us 
formulas for the solution to (0.4) on a bounded domain Q, with Dirichlet boundary 
conditions. This also leads to formulas for solutions to 


(0.7) Au= fonQ, u=OondQ, 
and 
(0.8) Au=0onQ, u=gonan. 


A different, and more natural, formula for the solution to (0.8) is derived in 85, 
after the development in §4 of a tool known as the “strong Markov property.” In 
86 we present a study of the Newtonian capacity of a compact set K C R”, in 
the case n > 3, which is related to the probability that a Brownian path starting 
outside K will hit k’. We give Wiener’s criterion for a point y in OC to be regular 
for the Dirichlet problem (0.8), in terms of the capacity of K, = {z € OQ: 
|z — y| <r}, as r > 0, which has a natural probabilistic proof. 
In §7 we introduce the notion of the stochastic integral, such as 


(0.9) | f(s,w(s)) dw(s), 


which is not straightforward since almost all Brownian paths fail to have locally 
bounded variation. We show how the solution to 


Ou 
(0.10) A TAT 


11 Brownian Motion and Potential Theory 405 


can be given in terms of an integral over path space, whose integrand involves a 
stochastic integral, in case X is a first-order differential operator. The derivation 
of this formula, like the derivation of the Feynman—Kac formula in §2, uses a tool 
from functional analysis known as the Trotter product formula, which we establish 
in Appendix A at the end of this chapter. 

In §8 we consider a more general sort of stochastic integral, needed to solve 
stochastic differential equations: 


(0.11) dX = b(t, ¥) dt + o(t, ¥) du, 


which we study in §9. Via Ito’s formulas, stochastic differential equations can be 
used to treat diffusion equations of the form 


a 
(0.12) 7 = S> Aji (a) Oj0u+ 5 b;(a) jut V(a)u, 


in terms of path space integrals. We look at this in §10. Results there, specialized 
to (0.10), yield a formula with a different appearance than that derived in §7. The 
identity of these two formulas leads to a formula of Cameron-Martin-Girsanov, 
representing the “Jacobian determinant” of a certain nonlinear transformation of 
path space. 

In §11 we consider diffusion on Riemannian manifolds. We produce Wiener 
measure on path space, by a process parallel to that used in 81 for the case 
M = R”, centered about the formula (0.5). A crucial difference is that the heat 
kernel p(t, x, y) has a simple explicit formula for R”. For more general Rieman- 
nian manifolds, we make use of various results established in Chapters 6 and 8. 
We assume // is a complete Riemannian manifold, and add other hypotheses as 
desirable, to yield needed heat kernel estimates. We establish “stochastic continu- 
ity,’ as an alternative to path continuity. This easier result still allows a derivation 
of the Feynman-Kac formula. We also treat diffusion with drift, involving semi- 
groups generated by A+ X, where X is a vector field on M, satisfying convenient 
hypotheses. A final topic we take up in §11 is relativistic diffusion on Minkowski 
space, which we relate to diffusion on hyperbolic space. 

A number of sources use stochastic ODE as an essential tool in the study of dif- 
fusion on manifolds, but we do not take this up here. The reader who gets through 
this chapter will be in a good position to appreciate such treatments, which can be 
found in [Stk3], [Hsu], and [Em]. 

Another important topic that we do not treat here is Malliavin’s stochastic 
calculus of variations, introduced in [Mal], which has had numerous interesting 
applications to PDE. We refer the reader to [Stk2] and [B] for material on this, 
and further references. 


406 11. Brownian Motion and Potential Theory 
1. Brownian motion and Wiener measure 


One way to state the probabilistic connection with the heat equation 


Ou 
1.1 —=A 
ae a 


is in terms of the heat kernel, p(t, x, y), satisfying 


(12) ef f(x) = / p(t, 2,4) f(y) aV(y). 


If A in (1.1) is the Friedrichs extension of the Laplacian on any Riemannian man- 
ifold M, the maximum principle implies 


(1.3) p(t, z,y) > 0. 


In many cases, including all compact M and M = R”, we also have 


(1.4) [vt z,y) dV(y) =1. 


Consequently, foreach x € M, p(t, x, y) dV(y) defines a probability distribution, 
which we can interpret as giving the probability that a particle starting at the point 
x at time 0 will be in a given region in // at time ¢. 

Restricting our attention to the case J = R”, we proceed to construct a proba- 
bility measure, known as “Wiener measure,” on the set of paths w : [0,0co) > R”, 
undergoing a random motion, sometimes called Brownian motion, described as 
follows. Given t; < t2 and that w(t;) = x1, the probability density for the loca- 
tion of w(t2) is 


(1.5) e48,, (aw) = p(t, x — 21) = (4nt)-/2e IP -1l"/44 tty — ty. 


The motion of a random path for t; < t < t2 is supposed to be independent of its 
past history. Thus, given 0 < t) < tg <--+ < ty, and given Borel sets FE; C R”, 
the probability that a path, starting at x = 0 at t = 0, lies in FE’; at time t; for each 
j € (1, A] is 


(1.6) [ov [ole - tao = @4—1)* ++ pltr, 1) dap, ><a. 


Ey Ex 


It is not obvious that there is a countably additive measure characterized by these 
properties, and Wiener’s result was a great achievement. The construction we give 
here is a slight modification of one in Appendix A of [Nel2]. 


1. Brownian motion and Wiener measure 407 


Anticipating that Wiener measure is supported on the set of continuous paths, 
we will take a path to be characterized by its locations at all positive rational t. 
Thus, we consider the set of “paths” 


(1.7) g= | R”. 


teQt 


Here, R” is the one-point compactification of R” (i.e., R” = R” U {oo}). Thus 
3B is a compact, metrizable space. We construct Wiener measure W as a positive 
Borel measure on B. 

By the Riesz theorem, it suffices to construct a positive linear functional 
E : C($8) > R, on the space C(B) of real-valued, continuous functions on 
RB, satisfying E(1) = 1. We first define E on the subspace C*, consisting of con- 
tinuous functions that depend on only finitely many of the factors in (1.7); that is, 
functions on 58 of the form 


(1.8) p(w) = F(w(t1),...,w(te)), ti<-+ < tk, 


where F' is continuous on ine R”, and t; € Q*. To be consistent with (1.6), we 
take 


E(y) = / :  fo(t,.01)plt — ty, £2 — 21) 


++ p(ty — th-1, 0 — Tk—-1) 
F(ax1,...,0,) dap-++ day. 


(1.9) 


If p(w) in (1.8) actually depends only on w(t,) for some proper subset {t,,} of 
{ti,...,tx}, there arises a formula for L(y) with a different appearance from 
(1.9). The fact that these two expressions are equal follows from the semigroup 
property of e’“. From this it follows that E : C# — R is well defined. It is also a 
positive linear functional, satisfying E(1) = 1. 

Now, by the Stone-Weierstrass theorem, C* is dense in C(8). Since 
E.: C# —R is a positive linear functional and E(1) = 1, it follows that E 
has a unique continuous extension to C'(8), possessing these properties. Thus 
there is a unique probability measure W on $B such that 


(1.10) E(y) = [ oe) dW (w). 


This is the Wiener measure. 


Proposition 1.1. The set Bo of paths from Q* to R", which are uniformly con- 
tinuous on bounded subsets of QT (and which thus extend uniquely to continuous 
paths from [0, co) to R"”), is a Borel subset of 38 with Wiener measure 1. 


408 11. Brownian Motion and Potential Theory 
For a set S, let oscg(w) denote sup, ,eg |w(s) — w(t). Set 
(1.11) E(a,b,€) = {w € PB: oscia,y(w) > 2e}; 


here [a, b] denotes {s € Q* : a < s < b}. Its complement is 


(1.12) (a,b,e)= () {we P: |w(s) —w(t)| < 2}, 


t,s€ [a,b] 


which is closed in $8. Below we will demonstrate the following estimate on the 
Wiener measure of E(a, b, €): 


(1.13) W (E(a,b,¢)) < 2e(5,|b—al), 

where 

(1.14) pte, 4) = sup / p(t.) de, 
are 


with p(t, 2) = e'6(a), as in (1.5). In fact, the sup is assumed at t = 6, so 


(1.15) p(e,6) = / p(1,y) dy = Yn( Fp): 
lyl>e/V6 


where 
(1.16) Un(r) = (4n)7"/? / eHll?/4 dy < apr? ten? /4, 
ly|>r 


as T —> 00. 
The relevance of the analysis of E(a, b, €) is that if we set 


(1.17) F(k,e,6) = {we P:43IC [0,k] N Qt, &(J) < 4, oscz(w) > 4e}, 
where ¢(./) is the length of the interval J, then 
(1.18) F(k,€,6) =| J{E(a,}, 2e) : [a,b] C [0, &], |b — al < 6} 


is an open set, and, via (1.13), we have 


(1.19) W(F(k,€,6)) < apr?) 6) 


1. Brownian motion and Wiener measure 409 
Furthermore, with F'°(k,¢,5) = 8 \ F(k,«¢, 6), 


Bo = {w: Vk < 00, Ve > 0,56 > 0 such that w € F°(k,€,6)} 


(1.20) =() i) LU G&e4) 


k e=1/v d6=1/p 


is a Borel set (in fact, an F,5 set), and we can conclude that W(%o) = 1 from 
(1.19), given the observation that, for any « > 0, 


(1.21) ple, 0) 


5 — 0, ad 0, 


which follows immediately from (1.15) and (1.16). Thus, to complete the proof 
of Proposition 1.1, it remains to establish the estimate (1.13). 


Lemma 1.2. Given <,6 > 0, take v numbers t; € Qt, 0< ty < +++ <t,, such 
that t, — t, < 6. Let 


(1.22) A= {we P: |w(t1) — w(t;)| >, for some j =1,...,v}. 


Then 
(1.23) W(A) < 2p(5,8). 
Proof. Let 

B= {w: |w(ti) —w(t.)| > 5 =}, 
(1.24) Cy = {w: w(t) — w(t.) > =} 


Dj = {w : |w(t1) — w(t) > € and 
l(t) — w(te)| Se,Vk < j— I}. 


Then A Cc BU LJ (C;N Dj), so 
j=l 


(1.25) W(A) < W(B) + 3 W(C;N D,;). 


Clearly, W(B) < p(e/2, 6). Furthermore, via (1.8)-(1.9), if we set 


D(w(t1), ae ,w(t;)) =1, ifw € Dj, 0 otherwise, 
C(w(t;),w(tv)) =1, ifw € Cj, 0 otherwise, 


410 11. Brownian Motion and Potential Theory 


we have C(x;,x%,) = Ci(a; — x,) and 


(1.26) 
W(Cj; 1 D;) 
=f. [ (v1,...,2;)C(xj, ey) p(ti, v1) p(te — t1, 22 — £1) +> 
tj-1,%j — ©;-1)p(ty — tj, ty — #5) du, dx; --- dx, 
“af -f D L1,...,@;)p(ti, £1) +++ p(t; —tj-1, 2; — 7-1) 
» daj-++ dx, 
Ss p(5,8) W(D3), 
so 
(1.27) S_W(C;ND,) < (5,9), 


J 
since the D; are mutually disjoint. This proves (1.23). 


Let us note an intuitive approach to (1.26). Since D; describes properties of 
w(t) for t € [t,t,] and C; describes a property of at v) — w(t;), these sets 
describe independent events, so W(C; N D;) = W(C; \W (Dy): meanwhile 
W(C3) < ple/2,6). 

We continue the demonstration of (1.13). Now, given such 7, as in the statement 
of Lemma 1.2, if we set 


(1.28) E = {w: |w(t;) — w(tx)| > 2e, for some j,k € [1,v]}, 
it follows that 


(1.29) W(E) < 29( 


since E is a subset of A, given by (1.22). Now, E(a,b,<), given by (1.11), is a 
countable increasing union of sets of the form (1.28), obtained, say, by letting 
{ti,..., tv} consist of all ¢ € [a,6] that are rational with denominator < K, 
and taking kK 7 ++oo. Thus we have (1.13), and the proof of Proposition 1.1 is 
complete. 

We make the natural identification of paths w € ‘8 with continuous paths 
w: [0,co) + R”. Note that a function y on Bo of the form (1.8), with t; € Rt, 
not necessarily rational, is a pointwise limit on Bo of functions in C*, as long as 
F is continuous on I R”, and consequently such ¢ is measurable. Furthermore, 
(1.9) continues to hold, by the dominated convergence theorem. 

An alternative approach to the construction of W would be to replace (1.7) 
by = II{R” : te Rt}. With the product topology, this is compact but not 


1. Brownian motion and Wiener measure 411 


metrizable. The set of continuous paths is a Borel subset of P, but not a Baire set, 

so some extra measure-theoretic considerations arise if one takes this route. 
Looking more closely at the estimate (1.19) of the measure of the set F'(k, ¢, 0), 

defined by (1.17), we note that you can take « = K,/0d log 1/0, in which case 


(1.30) ple,d) = vn (K log ;) ea en (log > 


Then we obtain the following refinement of Proposition 1.1. 


Proposition 1.3. For almost all w € ‘8, we have the modulus of continuity 
8/6 log 1/0, that is, given) <s,t<k <0, 


(1.31) lim sup ([u%s) —w(t)| — Sy ies ) aie 
|s—t|=50 ) 


In fact, (1.30) gives W(S;,) = 1, where 5S; is the set of paths satisfy- 
ing (1.31), with 8 replaced by 8+-1/k, and then (),, 5; is precisely the set of paths 
satisfying (1.31). 

This result is not quite sharp; P. Levy showed that, for almost all w € 8, with 


u(d) = 2,/dlog1/6,0 < s,t << k < ow, 


(1.32) jigeap OO, = 
|s—t|-0 u(|s 7 t|) 


See [McK] for a proof. We also refer to [McK] for a proof of the result, due to 
Wiener, that almost all paths w are nowhere differentiable. 


By comparison with (1.31), note that if we define functions X;, on ‘B, taking 
values in R”, by 


(1.33) X,(w) = w(t), 


then a simple application of (1.8)—(1.10) yields 


(1.34) IXlacp) = f belle) de = 2a, 
and more generally 
(1.35) [Xe — Xellzaqpy = V2n |s — t]'/?. 
Note that (1.35) depends on n, while (1.32) does not. 
Via a simple translation of coordinates, we have a similar construction for the 


set of Brownian paths w starting at a general point « € R°, yielding the positive 
functional E,, : C’($8) > R, and Wiener measure W,,, such that 


412 11. Brownian Motion and Potential Theory 


(1.36) E,(¢) = p(w) dW,(w). 


When p(w) is given by (1.8), £2 (wy) has the form (1.9), with the function p(t1, #1) 
replaced by p(t1,21 — x). To put it another way, E,.(~) has the form (1.9) with 
F(a1,...,@,) replaced by F(a, + 2,...,0% +2). 

We will often use such notation as 


E,(f(w(t)) 


instead of Sop tf (Xi(w)) dW, (w) or Ez (f(Xt(w))). 
The following simple observation is useful. 


Proposition 1.4. If ¢ € C(B), then E,,(p) is continuous in x. 


Proof. Continuity for y € C*, the set of functions of the form (1.8), is clear from 
(1.9) and its extension to x 4 0 discussed above. Since C# is dense in C(), the 
result follows easily. 


Exercises 


1. Given a > 0, define a transformation Da : Bo — Bo by 
(Daw)(t) = aw(a~*t). 


Show that D, preserves the Wiener measure W. This transformation is called Brownian 
scaling. 7 

2. Let Bo = {w € Po : lms s~tw(s) = 0}. Show that W (Po) = 1. 
Define a transformation p : Po — Yo by 


(ow)(t) = tw(t"), 


for t > 0. Show that p preserves the Wiener measure W. 
3. Given a > 0, define a transformation Ra : Bo — Bo by 


(Raw)(t) = w(t), forO <t <a, 
2w(a)—w(t), fort >a. 


Show that R, preserves the Wiener measure W. 

4. Show that L? ($80, dWo) is separable, for 1 < p < oo. (Hint: $8 is a compact metric 
space. Show that C'(98) is separable.) 

5. If0 < a1 < bi < a2 < be, show that X,, — Xa, is orthogonal to X,, — Xa, in 
L? (8, dW, R”), where X;(w) = w(t), as in (1.33). 

6. Verify the following identities (when n = 1): 


(1.37) E, (eee) = els? 


(1.38) Bz ([w(t) — w(s)]”") = hy |", 


2. The Feynman—Kac formula 413 
(1.39) E(w(s)w(t)) = 2 min(s, ¢). 


7. Show that II? ¢ L? ($80, dWo) if and only if \ < 1/8¢. 


2. The Feynman—Kac formula 


To illustrate the application of Wiener measure to PDE, we now derive a formula, 


known as the Feynman-Kac formula, for the solution operator e’(4-) to 
) 
(2.1) oT =Au-—Vu, u(0)=f, 


given f in an appropriate Banach space, such as L?(R”), 1 < p< w,orf € 
C,(R"), the space of continuous functions on R” vanishing at infinity. To start, 
we will assume V is bounded and continuous on R”. Following [Nel2], we will 
use the Trotter product formula 


(2.2) e(A-V) f= jim (c (1/4) 9 (EV f. 


k—oo 


For any k, (cl/4e =(t/k) ai f is expressed as a k-fold integral: 


(clr ert/0WV) * £(a) 
(2.3) -/- -f son)e pent ©) 9(2, op — tp1) e@/HV(@e-a) 
eo (t/B)V 2 p(Z, x — 21) dary -+-detg. 
Comparison with (1.36) gives 
(2.4) (c(t/ Sev) * F(a) Line, 


where 
—Si(w) i< jt 
(2.5) pr(w) = f(w(t)) e"*™), Sa (w) = k YL vV(o(4)). 


We are ready to prove the Feynman—Kac formula. 


Proposition 2.1. If V is bounded and continuous on R”, and f € C(IR") van- 
ishes at infinity, then, for all x € R”, 


414 11. Brownian Motion and Potential Theory 
(2.6) e(4-V) F(n) = Ey (F(w@)e fp vie ar) 


Proof. We know that e’(4—Y) f is equal to the limit of (2.4) as k —> oo, in the 
sup norm. Meanwhile, since almost all w € SB are continuous paths, S;,(w) > 
ah V(w(r))dr boundedly and a.e. on $8. Hence, for each 2 € R", the right side 
of (2.4) converges to the right side of (2.6). This finishes the proof. 


Note that if V is real-valued and in L°(R”), then e’4—V) is defined on 
L®(R”), by duality from its action on L'(R™), and 


(2.7) ECR”), i JF1 ee, SEM 1. 
Thus, if V is real-valued, bounded, and continuous, then, for all x € R”, 


(2.8) etlO-V)1 (2) = By (ee Verdier) | 


We can extend these identities to some larger classes of V. First we consider 
the nature of the right side of (2.6) for more general V. 


Lemma 2.2. Fix t € [0,00). FV € L°°(R”), then 


(2.9) Ivy (w) - | V(w(r)) dr 


is well defined in L°° (5B). If V,, is a bounded sequence in L™ (IR) and V, > V 
in measure, then Iy,, —> Iy boundedly and in measure on %B. This is true for each 
measure W,,, x € R”. 


Proof. Here, L°° is the set of equivalence classes (mod a.e. equality) of bounded 
measurable functions, that is, elements of £°(R”). Suppose W € £°(R”) is a 
pre-image of V. Then fe W(w(r)) dr = tw(w) is defined and measurable, and 
lewllexopy < I|Wllc~canyt. If W* is also a pre-image of V, then W = W* 
almost everywhere on R”. Look at U, defined on $8 x Rt by 


U(w, 8) = W(w(s)) — W* (w(s)). 


This is measurable. Let K C R” be the set where W(x) 4 W*(z); this has 
measure 0. Now, for fixed s, the set of w € $8 such that w(s) € AK has Wiener 
measure 0. By Fubini’s theorem it follows that U = 0 ae. on % x Rt, and 
hence, for almost all w € %, U(w,-) = 0 a.e. on Rt. Thus iG W#(w(r)) dr = 


i, W(w(r)) dr for ae. w € %, so Iy is well defined in L°° (38) for each V € 
L°(R”). Clearly, || Iv ||ze < ||V||z<t. 

If V, — V boundedly and in measure, in view of the previous argument we can 
assume without loss of generality that, upon passing to a subsequence, V,,(a) > 
V (a) for all w. Consider 


2. The Feynman—Kac formula 415 
U,(w, 8) = V(w(s)) — V.(w(s)), 


which is bounded in L°° (58 x R*). This converges to 0 for each (w,s) € BxRt, 
so by Fubini’s theorem again, is U,(w, s) ds — 0 for a.e. w. This completes the 
proof. 


A similar argument yields the following. 


Lemma 2.3. If V € Li..(IR") is bounded from below, then 


loc 
(2.10) ey (w) = e~ Jo Vw) dr 


is well defined in L™ (38). If V, € Lj,(IR") are uniformly bounded below and 
V, > V in Lj, then ey, > ey boundedly and in measure on %. 


loc? 


Thus, if V € Li.(R”"), V > —K > —oo, take bounded, continuous V, such 
that V, > —K and V, > V in L},. We have ||e“4-™) || < eX* for all v, where 
|| - || can be the operator norm on L?(R”) or on C,(R”). Now, if we replace V 
by V, in (2.6), then Lemma 2.3 implies that, for any f € C§°(IR”), the right side 
converges, for each x, namely, 


(2.11) Ey (fw@)e Se eee —+ P(t) f(x), as v > ov. 


Clearly |P(t) f(x)| < e**Ex(|f|) < e**||f||L20. Consequently, for each x € R”, 
if f € Cp°(R"), 


(2.12) eA) f(x) 4 PU) f(a) = Ee (fw) KYO Hr) 


It follows that P(t) : Cg°(IR”) > L°°(R”). Since 
(2.13) |e“A-) F(ax)| < Fe! | f\(a), 


we also have P(t) : C§°(R”) + L1(R”). Furthermore, we can pass to the limit in 
the PDE 0u,,/dt = Au, —V,u, for u, = e4-“) f, to obtain for u(t) = P(t) f 
the PDE 
(2.14) Ou =Au-—Vu, u(0)=f. 
ot 

If A — V, with domain D = D(A) /N D(V), is self-adjoint, or has self-adjoint 
closure A, the uniqueness result of Proposition 9.11 in Appendix A, Functional 
Analysis, guarantees that P(t) f = e’4f. For examples of such self-adjointness 
results on A — V, see Chap. 8, §2, and the exercises following that section. Thus 
the identity (2.6) extends to such V, for example, to V € L°(R”); so does the 
identity (2.8). 

We can derive a similar formula for the solution operator S(t, 0) to 


416 11. Brownian Motion and Potential Theory 


Ou 


(2.15) a 


Au — V(t, x)u, u(0) = f, 


using the time-dependent Trotter product formula, Proposition A.5, and its conse- 
quence, Proposition A.6. Thus, we obtain 


(2.16) S(t,0) f(x) = EB, (F(e())e“ i Verte) dr) 


when V(t) € C((0, co), BC(IR")), BC(R”) denoting the space of bounded con- 
tinuous functions on R”. By arguments such as those used above, we can extend 
this identity to larger classes of functions V(t). 


Exercises 
1. Given e > 0, € R, compute the integral operator giving 
(2.17) et(Oz—€*— Az) Fr), 


(Hint: Use ex? + Ax = e(x + A/2e)? — A?/4e to reduce this to the problem of 
computing the integral operator giving 


(2.18) et(z-2@") g(a), 


For this, see the material on the harmonic oscillator in 86 of Chap. 8, in particular, 
Mehler’s formula.) 
2. Obtain a formula for 


(2.19) E, ‘Ca Sg w(s)? ds—X SG w(s) =) = et(Oz—ex? —Azx) 1(a) 
in the case of one-dimensional Brownian motion. (Hint: Use the formula 
etl ex?) 1(a) = a(t)e 2” 
(2.20) 1/2 1 
a(t) = (cosh 2,/et) , (t)= ave tanh 2,/et, 


which follows from the formula for (2.18). Alternatively, verify (2.20) directly, exam- 
ining the system of ODE 


a'(t) =—2a(t)b(t), b'(t) = e — 4b(t)?.) 
3. Pass to the limit e \, 0 in (2.19), to evaluate 


(2.21) E, (-* ie ao 


Note that the monotone convergence theorem applies. 


Exercises 4 and 5 will investigate 


(2.22) we) = Wo ({- ex: [oer ds < e}) =P (fey ds < :) 


3. The Dirichlet problem and diffusion on domains with boundary 417 


4. Using Exercise 2, show that, for all A > 0, 


q w'(s)e** ds = Eo (ee w(s)? - 
0 
= (cosh 2Qav/x)~/? = ie A sh ere’, 


Other derivations of (2.23) can be found in [CM] and [Lev]. 
5. The subordination identity, given as (5.22) in Chap. 3, implies 


4 ya(se* ds=V2e~** if pas) = es 38/2e 4" 
0 


(2.23) 


Deduce that 


hence that 


eT 28a /4e 3. ge Ble? /4e ‘|. 


: =f 

2 8 
Show that the terms in this alternating series have progressively decreasing magnitude 
provided e/a? < 1/2. (Hint: Use the power series 


2 i 3 
14 1/2 _ 4 2 
(1+ y) gy t ey 


with y = e~4¢V ) 
6. Suppose now that w(t) is Brownian motion in R”. Show that 


Eo (ee |~o(s)]? - = (cosh QaVvr)~/?, 
Deduce that in the case n = 2, 


d a 2 —é —a*/e —9a? /e —25a* /e 
=P(f |w(s)P ds <e) = Mele nie 'ger® (© 4 be? a <-], 


Show that the terms in this alternating series have progressively decreasing magnitude 
provided e < 2a’. 


3. The Dirichlet problem and diffusion on domains 
with boundary 


We can use results of §2 to provide connections between Brownian motion and 
the Dirichlet boundary problem for the Laplace operator. We begin by extending 
Lemma 2.3 to situations where V, 7 V, with V(x) possibly equal to +co ona 
big set. We have the following analogue of Lemma 2.3. 

Lemma 3.1. Let V, € L,.(R”"), —K < VL /V, with possibly V(x) = +00 on 


loc 


a set of positive measure. Then ey (w), given by (2.10), is well defined in L™® (3B), 


418 11. Brownian Motion and Potential Theory 


provided we set e~ °° 


each t. 


= 0, and ey, — ey boundedly and in measure on Q, for 


Proof. This follows from the monotone convergence theorem. 


Thus we again have convergence with bounds in (2.11)—(2.13). We will look at 
a special class of such sequences. Let 2 C R” be open, with smooth boundary 
(in fact, Lipschitz boundary will more than suffice), and set F = R” \ Q. Let 
V, => 0 be continuous and bounded on R” and satisfy 


(3.1) V,=009, V,>vonE,, V, A, 


where FE, is the set of points of distance > 1/y from 9. Given f € L?(R”), 
g € L7(Q), set Pof = flo € L?(Q), and define Egg € L?(R") to be g(x) for 
xé€Q, Oforz€e H=R”"\Q. 


Proposition 3.2. Under the hypotheses above, if f € L?(IR"), then 


(3.2) e(A-V) # _, Roe'49(Pof), 


as v — oo, where Ag is the Laplace operator with Dirichlet boundary 
condition on Q. 


Proof. We will first show that, for any \ > 0, 
(3.3) (A-A+V,)"f 3 Ea(A- Ag) "Poff. 


Indeed, denote the left side of (3.3) by u,, so (A - A+ V,)u, = f. Taking the 
inner product with u,, we have 


(3.4) Nfusla+ Yura + f Vous de = (fiw) S * eva + a MF llze 


~— 2 


so 


(3.5) > ule + ||Vur|lZ2 + fv |uy|? dx < slits. 


Thus, for fixed \ > 0, {u, : v € Zt} is bounded in H!(R”), while 
Jp, (uv? dx < C/v. Thus {u,} has a weak limit point wu ¢ H'(R”), and 
u = 0 on UE,. The regularity hypothesized for OQ implies u € H4(Q). Clearly, 
(A — A)u = f on Q, so (3.3) follows, with weak convergence in H'(R™). But 
note that, parallel to (3.4), 


Allulli2 + lVulliz = (f,u) = lim (f,w), 


so 


3. The Dirichlet problem and diffusion on domains with boundary 419 


(3.6) Allullz2 + ||Vullz2 > limsup Allur||Z2 + ||Vurl|z2- 
vV—-oco 


Hence, in fact, we have H!-norm convergence in (3.3), and a fortiori L?-norm 
convergence. 

Now consider the set F of real-valued y € C,([0,00)) such that, for all f € 
EAR), 


(3.7) y(-A + V,)f — Eoy(—Ag)Pof, in L?(R”)-norm, 


where (/) is defined via the spectral theorem for a self-adjoint operator H. 
(Material on this functional calculus can be found in §1 of Chap. 8.) The analysis 
above shows that, for each \ > 0, r,(s) = (A+ s)~1 belongs to F. Since Po Eo 
is the identity on L7(Q), it is clear that F is an algebra; it is also easily seen to be 
a closed subset of C,([0,00)). Since it contains r, for A > 0, it separates points, 
so by the Stone-Weierstrass theorem all real-valued y € C,([0, 00)) belong to F. 
This proves (3.2). 


The version of (2.12) we have this time is the following. 


Proposition 3.3. Let QC R” be open, with smooth boundary, or more generally 
with the property that 


{u € H'(R") : supp u CO} = HG(Q). 
Let F € CS°(R"), f = Fla. Then, for allx €Q, t > 0, 


(3.8) ef f(x) = Ey (Fem)er Je taw(r)) ) . 


On the left, e’“ is the solution operator to the heat equation on R*+ x Q with 
Dirichlet boundary condition on OQ, and in the expression on the right 


(3.9) lo(z) =0 onQ, +00 onR"\0= EL. 
Note that, for w continuous, 


e So falw(r)ar — op (wy t)=1 ifw((0,4]) CO 
6) 10) Va ? ’ ’ 
0 otherwise. 
The second identity defines Yo(w, t). Of course, for w continuous, w({0, t]) C 2 
if and only if w([0,t] NQ) CQ. 
We now extend Proposition 3.3 to the case where 2. C R” is open, with no 
regularity hypothesis on OQ. Choose a sequence (2; of open regions with smooth 
boundary, such that Q; CC Qj41 CC :::, U; Q; = Q. Let A; denote the 


420 11. Brownian Motion and Potential Theory 


Laplace operator on 2;, with Dirichlet boundary condition, and let A denote that 
of Q, also with Dirichlet boundary condition. 


Lemma 3.4. Given f € L?(Q), t > 0, 

(3.11) cAf= Jim, E,;e'“s P; f, 

where P; f = fla, and, for g € L?(Q;), Ejg(x) = g(x) for x € Qj, 0 for 
rEQ\Q;. 

Proof. Methods of Chap. 5, §5, show that, for A > 0, 

(3.12) Ej(\— Aj) "Pf 3 (A-A)'f 


in L?-norm, and then (3.11) follows from this, by reasoning used in the proof of 
Proposition 3.2. 


Suppose f € C§°(Qz). Then, for j > L, Bje“4i f > e'4f in L?-norm, as 
we have just seen. Furthermore, local regularity implies 


(3.13) E,;e'™ f — e'Sf locally uniformly on 2. 


Thus, given such f, and any x € 2 (hence x € (; for 7 large), 
(3.14) ef f(z) = lim EB, (fu) dq, .) 
j-oco J 


Now, as 7 — 00, 


(3.15) bq, (6, t) 7 Yalv,t), 
where we define 


wt) =1 ifw((0,t]) CQ, 
ve Yolw,t) =1 ifo((0,4) 

0 otherwise. 
This yields the following: 


Proposition 3.5. For any openQ. C R", given f € C§(Q), © EQ, 
(3.17) e'4 f(x) = Ex (f(w(t)) da(w,t)). 
In particular, if 2 has smooth boundary, one can use either Wo(w, t) or Po(w, t) 


in the formula for e'4 f (a). However, if OQ is not smooth, it is 7e(w, t) that one 
must use. 


3. The Dirichlet problem and diffusion on domains with boundary 421 


It is useful to extend this result to more general f. Suppose f; € Cg°(Q), f € 
L?(Q), and f;(x) \, f(z) for each x € ©. Then, for any t > 0, ef; > ef 
in L?(Q) A C™(Q), while, for each x € Q, E,(f;(w(t))boa(w, t)) converges \, 
to the right side of (3.17), by the monotone convergence theorem. Hence (3.17) 
holds for all such f; denote this class by £(Q). Clearly, the characteristic function 
xK € £(Q) for each compact K C 2. 

By the same reasoning, the class of functions in L?(Q) for which (3.17) holds 
is closed under forming monotone limits, either f; 7 f or f; \, f, of sequences 
bounded in £7(Q). An argument used in Lemma 2.2 shows that modifying f € 
L?(Q) ona set of measure zero does not change the right side of (3.17). If S Cc Q 
is measurable, then 

Xs(x) = jim, XK; (x), ae., 
for an increasing sequence of compact sets K; C S, so (3.17) holds for f = xg. 
Thus it holds for finite linear combinations of such characteristic functions, and 
an easy limiting argument gives the following: 


Proposition 3.6. The identity (3.17) holds for all f € L?(Q) whent > 0, x EQ. 


Suppose now that 2 is bounded. Then, for f € L?(Q), 1 < p< ov, 
(3.18) -Aty= f or Fak 
0 


the integral being absolutely convergent in L?-norm. If f € C§°(Q), we hence 
have, for each x € 2, 


(3.19) —A~* f(x) = Ex ( | _ f (w(t) vale, t) at) 


Furthermore, by an argument such as used to prove Proposition 3.6, this identity 
holds for almost every x € Q, given f € L?(Q), and for every x if f; € C§°(Q) 
and f;(x) 7 f(a) for all x. In particular, for 2 bounded, 

(3.20) —A7'1(2) = E,(Soa(w)), 2eEQ, 


where, if w is a continuous path starting inside 2, we define 


Vo(w) = [ voa(w,t)dt = sup {t : w((0, t]) c OF 
= min {t: w(t) € OO}. 


(3.21) 


In other words, Vo(w) is the first time w(t) hits 0; it is called the “first exit 
time.” Since A~11 € C™%(Q), it is clear that the first exit time for a path starting 
at any x € 1) is finite for W,,-almost every w when 2 is bounded. (If w starts at a 
point in QO or in R” \ Q, set Yea(w) = 0.) Note that we can write 


422 11. Brownian Motion and Potential Theory 


Vo (w) 
(3.22) —A“'f(z) = E, (/ f (w(t) «) 


If OQ is smooth enough for Proposition 3.3 to hold, we have the formula (3.19), 
with Yo(w, t) replaced by Y5(w, t), valid for all 2 € Q. In particular, for smooth 
bounded 2, 


(3.23) —A7'1(x) = E,(dg(w)), «ce, 
where we define 
(3.24) 0q(w) = inf {t: w(t) € R® \ O} = max {t: (0, #]) CO}. 


(If w(0) € R” \ Q, set Vg(w) = 0.) Comparing this with (3.20), noting that 
Jo(w) > Va(w), we have the next result. 


Proposition 3.7. If Q is bounded and OQ is smooth enough for Proposition 3.3 
to hold, then 


(3.25) rE Q = > Vo(w) = Vow), for Wz- almost every w, 
and 
(3.26) rE OQ => Vo(w) = 0, for Wz- almost every w. 


The probabilistic interpretation of this result is that, for any x € ©, once a 
Brownian path w starting at x hits OQ, it penetrates into the interior of R” \ Q 
within an arbitrarily short time, for W,,-almost all w. From here one can show 
that, given x € OQ, W,-a.e. path w spends a positive amount of time in both 2 
and R™ \ Q, on any time interval 0, so], for any so > 0, however small. This is 
one manifestation of how wiggly Brownian paths are. 

Note that taking f = 1 in (3.17) gives, for all 2 € Q, any open set in R”, 


(3.27) e'1(z) = Wz ({w: Yaw) > t}), «EQ, 


the right side being the probability that a path starting in 2 at x has first exit 
time > t. Meanwhile, if OQ is regular enough for Proposition 3.3 to hold, then 


(3.28) 41 (a) = We ({w : Vq(w) > th). 


Comparing these identities extends Proposition 3.7 to unbounded 22. 
The following is an interesting consequence of (3.28). 


Proposition 3.8. For one-dimensional Brownian motion, starting at the origin, 
givent >0, A> 0, 


3. The Dirichlet problem and diffusion on domains with boundary 423 


(3.29) W({w: sup w(s) > A}) = 2W({w: w(t) > rf). 
O<s<t 
Proof. The right side is [°° p(t,x) dx, with p(t,z) = e/4*5(2) = 


(4rt)~1/2e-#°/4t, the n = 1 case of (1.5). The left side of (3.29) is the same 
as W({w : V(—oo,,)(w) < t}), which by (3.28) is equal to 1 — e*”1(0) if 
L = d?/dx? on (—oo, A), with Dirichlet boundary condition at x = . By the 
method of images we have, for x < 4, 


e1(x) = i p(t, y)HO— +) dy, 


where H(s) = 1 for s > 0, —1 for s < 0. From this, the identity (3.29) readily 
follows. 


We next derive an expression for the Poisson integral formula, for the solution 
PI f =uto 


(3.30) Au=0o0nQ, uloo=f. 


This can be expressed in terms of the integral kernel G(x, y) of A~! if OO is 
smooth. In fact, an application of Green’s formula gives 


(3.31) PI f(x )= [0 5, G(x, y) dS(y), 


where v, is the outward normal to OQ) at y. A closely related result is the fol- 
lowing. Let f be defined and continuous on a neighborhood of 0Q. Given small 
6 > 0, set 

(3.32) Ss = {x € : dist(x, OQ) < do}, 

and define ws by 


Aus = 6-7 fs0nQ, us = 00n ON, 
fs = f on Ss, OonQ \ Sz. 


(3.33) 
Lemma 3.9. /f OQ is smooth, then, locally uniformly on Q, 


1 
34. li So pry 
(3.34) lim us 5 f 


Proof. If v is the outward normal, we have 


424 11. Brownian Motion and Potential Theory 
é 
us(x) = 8? ff G(e,y — sv) f(y) ds dS(y) + 0(1) 
0a 
-9 a ° 
(3.35) =-6 [ toz,¢en(f sds) dS(y) + o(1) 
aa. 
1 3) 
=-5 | tw) g-Gle.w) a5) + of0), 
y 
aa. 


so the result follows from (3.31). 


Comparing this with (3.22), we conclude that when 02 is smooth, 


9) Va(w) 
(3.36) PIF(z) = lim & Ee (/ Floto) at), 


where Sj; is as in (3.32), and, for S Cc Q, 


is(w,t)=1 ifw(t)EeS, 


(3.37) . 
0 otherwise. 


We will discuss further formulas for PI f in §5. 


Exercises 


1. Looking at the definitions, check that ~o(w,t) and Jg(w) are measurable when 
Q C R” is open with smooth boundary and that Wo(w, t) and Ye (w) are measurable, 
for general open 2 C R”. 

2. Show that if z € O, then 


(3.38) {w € Po: Va(w) <t}= LJ {we Po:u(s) eR” \ O}. 


s€[0,to)NQ 
3. For any finite set S = {s1,...,sx} CQ*, N EZ", set 
Fy,s(w) = ®y,s(w(s1)...w(sx)), 
®y,s(v1,--.,2K) = min(N, min{s, : vy € R” \ O}). 
Show that, for any continuous path w, 


(3.39) Jo(w) =sup inf Fy,s(w). 
n 8 


Note that the collection of such sets S is countable. 
4. If Po. = {w € Po : Vow) < N} and O is bounded, show that 


(3.40) Wa(Bo\ Pon) < CN. 


Exercises 425 


(Hint: Use (3.23).) 
5. Ifw € Boy. show that 


(3.41) 85(w) = lim Bn) 


v—-oo 


where 
Oy,n(w) = min(N, inf{s € 2-”Z* : w(s) ¢ O}). 


Write 0v,n(w) = ©,,n (w(s1), a ,w(sz)), where ®,, y has a form similar to ®y,s5 
in Exercise 3. 

6. For one-dimensional Brownian motion, establish the following, known as 
Kolmogorov’s inequality: 


(3.42) W({w: sup |w(s)|>e}) <5, €>0. 


(Hint: Write the left side of (3.42) as W ({w : J(—<,<)(w) < t}), and relate this to the 
heat equation on Q = [—<, €], with Dirichlet boundary condition, in a fashion parallel 
to the proof of Proposition 3.8.) 

Note that this estimate is nontrivial only for t < ¢?/2. By Brownian scaling, it suffices 
to consider the case € = 1. Compare the estimate 


w({os sup loll zeh) <4 pita) ae, 


which follows from (3.29). 
7. Given Q2 C R” open, with complement /K, and A with Dirichlet boundary condition 
on OQ, show that, for x € Q, 


(3.43) Wz ({w : Vo(w) = oo}) = Hx(z), 
where 
(3.44) Hix (t,x) = e'“1(x) \, Hx(x), ast 7 oo. 


8. Suppose that K = R” \ Q is compact, and suppose there exists Hx(x) € C(Q), 
harmonic on Q, such that Hx = 0 on OK and Hx (x) — 1, as |x| — oo. Show that 


Hx(t,x) > Hx(x), forall t < oo. 


(Hint: Show that AHx (t,x) < 0 and that Hx (t,x) — 1 as |x| — oo, and use the 
maximum principle.) 
Deduce that if such Hx () exists, then W, ({w : Ja(w) = cof) > 0. 


9. In the context of Exercise 8, show that if such Hx exists, then in fact 
(3.45) Hx(a) = Hx(x), forallx € 9. 


(Hint: Show that Hix must be harmonic in Q and that lim sup),;_.., Hx(x) < 1.) 
By explicit construction, produce such a function on R” \ B when B is a ball of radius 
a > 0, provided n > 3. 

10. Using Exercises 7-9, show that when n > 3, 


426 11. Brownian Motion and Potential Theory 


(3.46) Wz ({w : |w(t)| > 00 as t > oof) = 1. 


(Hint: Given R > 0, the probability that |w(t)| > R for some t is 1. If R >> a, and 
|w(to)| > A, show that the probability that |w(to + s)| < a for some s > 0 is small, 
using (3.43) for K = Ba = {x : |x| < a}.) To restate (3.46), one says that Brownian 
motion in R” is “non-recurrent,” for n > 3. 

ll. Ifn < 2and K = Bg, show that Hx (t,x) = 0 in (3.44), and hence the probability 
defined in (3.46) is zero. Deduce that ifn < 2 and U C R” is a nonempty open set, 
almost every Brownian path w visits U at an infinite sequence of times t, — oo. 

One says that Brownian motion in R” is “recurrent,” for n < 2. 

12. Relate the formula (3.34) for PI f to representations of PI f by double-layer potentials, 
discussed in §11 of Chap. 7. Where is the second layer coming from? 

13. If Q is a bounded domain with smooth boundary, show that (3.36) remains true with 
S5 replaced by = 

Ss = {x2 € R”\Q: dist(x, AQ) < 5} 
and with Jo(w) replaced by Yo,(w), where Q5 = QU S5. (Hint: Start by showing 
that t5(a~) — —(1/2)PI f(a), for « € Q, where, in place of (3.33), 


Aus = 5 fs on Qs, ts =00ndQs;, 


with fs = f on Ss, OonQ. 


4. Martingales, stopping times, and the strong 
Markov property 


Given t € [0, 00), let 8, be the o-field of subsets of %o generated by sets of the 
form 


(4.1) {w € Po : w(s) € EF}, 


where s € [0,t] and £ is a Borel subset of R”. One easily sees that each element 
of 8; is a Borel set in $B. As ¢ increases, 8, is an increasing family of o-fields, 
each consisting of sets which are W,,-measurable, for all « € R”. Set B = 


@ (Ue ces B,) . 
Given f € L1(%o,Bx.,dW.), we can define the conditional expectation 


(4.2) Ex(f|B:), 


a function measurable with respect to 8,, as follows. Denote by W,, ; the restric- 
tion of the Wiener measure W,, to the o-field 8;. Then 


(43) MS) = / f(o) dWe(w) = Ex (Fxs) 
S 


4. Martingales, stopping times, and the strong Markov property 427 


defines a countably additive set function on 8;, which is absolutely contin- 
uous with respect to W;.4, so by the Radon-Nikodym theorem there exists a 
$$ ,-measurable function ®;, uniquely defined W,, ,-almost everywhere, such that 
(4.3) is equal to f, ®;(w) dWz,+(w), for all S € B;. This function is E,(f|B:). 
Clearly, 


(4.4) f €L' (Po, Boo, dWa) => Ex(f|Bz) € L' (Po, Bs, dWa,t). 


This construction of conditional expectation generalizes in the obvious way to 
any situation where f is measurable with respect to some o-field §, and is L! 
with respect to a given probability measure on ¥, and one wants to define the 
conditional expectation E(f|*%o) with respect to some sub-o-field §o of F. 

Note that we can regard L'(%o, 81, dW,.4) naturally as a closed linear sub- 
space of L1(Bo,B. ,dW,). Then the map f +> E,(f|®B,) is a projection. 
Similarly, we have 


f € L? (Bo, Boo, Wz) = E,(f|B:) € L? (Bo, Br, Wz. 2), 


and in this case F,(f|%,) is simply the orthogonal projection of f onto 
L? (Bo, Br, dW), regarded as a linear subspace of L?(Bo, Bx0.,dW,). The 
reader might think of this in light of von Neumann’s proof of the Radon-Nikodym 
theorem, which is sketched in the exercises for §2 of Appendix A. 

The following is a statement that Brownian motion possesses the Markov 


property. 


Proposition 4.1. Given s,t > 0, f € C(R”), 
(4.5) E, (f(w(t + s)) |B.) = E.s)(f(w(t))), for W,,-almost all w. 


Proof. The right side of (4.5) is %,-measurable, so the identity is equivalent to 
the statement that 


46) f fw(t+s) aww) = [(f FG) aWacy@) dW.(W), 
S Ss 
for all S € B,. It suffices to verify (4.6) for all S of the form 


S = {w € Po: w(t) € Fi,...,w(tk) € Ex}, 


given t; € [0,s], E; Borel sets in IR”. For such S, (4.6) follows directly from the 
characterization of the Wiener integral given in §1, that is, from (1.6)—(1.9) in the 
case x = 0, together with the identity 


(4.7) jf #@W) aw) = BFW + (0) 


428 11. Brownian Motion and Potential Theory 
used to define (1.36). 
We can easily extend (4.5) to 


(4.8) E,(F(w(s+t),...,w(s+tx))|Bs) = Eas) (F(w(t1),--.,w(te))), 


for W,,-almost all w, given t1,...,¢, > 0, and Ff’ continuous on i R", as in 
(1.8). Also, standard limiting arguments allow us to enlarge the class of functions 
F for which this works. We then get the following more definitive statement of 
the Markov property. 


Proposition 4.2. For s > 0, define the map 
(4.9) Js: Po — Po, (gsw)(t) = w(t + 8). 
Then, given p bounded and 8,-measurable, we have 
(4.10) Ey (y Os |B.) = E,,(s)(y), for Wz-almost all w. 
The following is a useful restatement of Proposition 4.2. 
Corollary 4.3. For s > 0, define the map 
(4.11) 0: Bo > Bo, (0.w)(t) = w(t + s) — u(s). 
Then, given p € L1(Bo, dWo), we have 
(4.12) Ez(y 0 9,|Bs) = Eo(). 
In particular, 
(4.13) Ex(f(0s(t))|Bs) = Fo(f()). 
Note that (4.12) implies 17, is measure preserving, in the sense that 
(4.14) W2(95°(S)) = Wo(S), 
for Wo-measurable sets S. The map v7, is not one-to-one, of course, but it is onto 
the set of paths in Bo satisfying w(0) = 0. 
The Markov property also implies certain independence properties. A function 
yp € L(o,dW,,) is said to be independent of the c-algebra 8, provided that, 


for all continuous F’, 


(4.15) [ Few) dW,,(w) = W,(S)Ez(Foy), VS EB. 
S 


An equivalent condition is 


4. Martingales, stopping times, and the strong Markov property 429 


(4.16) Bx (F(y))) = Ex(F(¢))Ee(¥), Vb € L'(Po, Br, dWe), 
given F'(y)w € L1 (9, dW,,), and another equivalent condition is 
(4.17) E, (F(¥)|8+) = Ex (F(¢)). 


In turn, this identity holds whenever the left side is constant. From Corollary 4.3 
we deduce: 


Corollary 4.4. For s > 0, Jsw(t) = w(t + s) — w(s) is independent of B 5. 
Proof. By (4.13), 

(4.18) E,(FW(t + s) — w(s))|Bs) = Fo(F(w(4))), 

which is constant. 


The Markov property gives rise to martingales. By definition (valid in gen- 
eral for an increasing family 8; of o-fields), a martingale is a family Fy € 
D+ (Bo, Bz, dW.) such that 


(4.19) E,(F,|8s) = Fs whens <t. 


If F,.(F;|8;,) > F; fors < t, {F;} is called a submartingale over 8,. The 
following is a very useful class of martingales. 


Proposition 4.5. Let h(t,x) be smooth int > 0,x € R”, and satisfy |h(t, x)| < 
Ceetl2!” for all e > 0, and the backward heat equation 


Oh 


4.2 — = —Ah. 
2) ot 


Then hi (w) = h(t, w(t)) is a martingale over By. 
Proof. The hypothesis on h(t, x) implies that, for t, s > 0, 


(4.21) ns.) = | p(t, y)h(t+ 5,2 ~y) dy, 


where p(t, a) = e'6(a) is given by (1.5). Now 


E,(61+s|Bs) = E,(h(t+ s,w(t + s))|B,) 


(4.22) 
= Fiu(s) (h(t + 8, w(t))), 


for W,,-almost all w, by (4.5). This is equal to 


(4.23) [rly 0(9)) h(t + 5,9) dy 


430 11. Brownian Motion and Potential Theory 


by the characterization (1.9) of expectation, adjusted as in (1.36), and by (4.21) 
this is equal to h(s,w(s)) = h.(w). 


Corollary 4.6. For one-dimensional Brownian motion, the following are martin- 
gales over By: 


2) te(w) = w(t), qe(w) = w(t)? — 24, g2(w) = ew", 
givena > 0. 


One important property of martingales is the following martingale maximal 
inequality. 


Proposition 4.7. If F;, is a martingale over By, then, given any countable set 
{tj} C R*, the “maximal function” 


(4.25) F*(w) = sup Fi, (w) 
J 
satisfies, for all X > 0, 
1 
(4.26) W,({w : F*(w) > A}) < 5 lFillz+0,aw,): 


Of course, the assumption that F;, is a martingale implies that ||F;||1 is inde- 
pendent of t. 


Proof. It suffices to demonstrate this for an arbitrary finite subset {t;} of Rt. 
Thus we can work with f;(w) = Fi,(w),8; = B:,;,1 < j < N, and take 
ty < to <-+-++ < ty, and the martingale hypothesis is that E,(fr|B;) = fj; when 
j <k. There is no loss in assuming fy (w) > 0, so all f;(w) > 0. Now consider 


(4.27) Sy, ={w: f*(w) > A} = {w: some f;(w) > Af. 

There is a pairwise-disjoint decomposition 
N 

(4.28) S,=(J5)3, Sri = {w: fi(w) > A but fe(w) < A for £ < J}. 
j=l 


Note that 5; is % ;-measurable. Consequently, we have 


4. Martingales, stopping times, and the strong Markov property 431 


/ Pehaawatas 
Sy 


(4.29) ss ' fr (w) dW. ( ->f fi(w) dW,(w) 


j= en I= sy; 


> > We (Saj) = A Wee (Sa). 


This yields (4.26), in this special case, and the proposition is hence proved. 


Applying the martingale maximal inequality to 3;(w) = eow(t)—a7t we obtain 


the following. 


Corollary 4.8. For one-dimensional Brownian motion, given t > 0, 


(4.30) Wo({w € Bo: sup w(s) —as > A}) < e:?s, 
O<s<t 


Proof. The set whose measure is estimated in (4.30) is 


: 2 
{w € Bo: oe etw(s)—a"s x o>}. 
SSS 


Since paths in $B are continuous, one can take the sup over [0, t] 1 Q, which is 
countable, so (4.26) applies. Note that Fo(3:) = 1. 


We turn to a discussion of the strong Markov property of Brownian motion. 
For this, we need the notion of a stopping time. A function 7 on S89 with values 
in [0, +00] is called a stopping time provided that, for each t > 0, {w € Po: 
T(w) < t} belongs to the o-field B;. It follows from (3.39) that Je is a stopping 
time. So is Vo. 

Given a stopping time 7, define 8, to be the o-algebra of sets S € 8 such 
that SM {w : T(w) < t} belongs to 8; for each t > 0. Note that 7 is measurable 
with respect to 8,,. The hypothesis that 7 is a stopping time means precisely 
that the whole set ‘Bo satisfies the criteria for membership in 8,4. We note that 
any t € [0, 00), regarded as a constant function on Bo, is a stopping time and that, 
in this case, Bi, = (),.,B 

The following analogue sf Propositions 4.1 and 4.2 is one statement of the 
strong Markov property. 


Proposition 4.9. [f7 is a stopping time such that T(w) < co for W,.-almost all w, 
and if t > 0, then 


(4.31) Bz(f(w(r +8) [Br+) = Ba (F(o(O)); 


for W,,-almost all w. More generally, with 


432 11. Brownian Motion and Potential Theory 
(orw)(t) = w(t +7), 
and p bounded and 8 ,.-measurable, we have 
(4.32) Ez (p00,|Br+) = Euny(9); 
for W,,-almost all w. 


As in (4.6), the content of (4.31) is that 


(4.33) [He f(w(r +t) dWe(w = f [fle f (w(t) dWury(w #)) dW.(w), 


given S € $4. In other words, given that SM {w : T(w) < t’} © By, for each 
t' > 0. There is no loss in taking x = 0, and we can rewrite (4.33) as 


(4.34) [1 (r +t)) dW(w y= ff 100m w(r)) dW(w*) dW(w). 


It is useful to approximate 7 by discretization: 


(4.35) r(w) = 27-Yk, if 2-"(k — 1) < rw) < 27%. 
Thus 
(4.36) {w: tT, (w) < th = {w:7(w) < 2°-”k} © By, 


so each 7, is a stopping time. Note that 


App = {w: T(w) =2°-"k} 


(4.37) _ = 
= {w:t(w) <2°-"k}\ {w: tw) < 2°-"(k—1)} 


belongs to Bo-v 4. 
If r is replaced by 7,, the left side of (4.34) becomes 


(4.38) S- / f(w(t+2-"k)) dW), 
VAR SAL, 
and the right side of (4.34) becomes 
(4.39) > / / f (w*(t) + w(2-"k)) dW(w*) dW(w). 
Wk SALE 


Note that if S € 8,4, then SM Ay, € Bo-v;,. Thus, the fact that each term in 
the sum (4.38) is equal to the corresponding term in (4.39) follows from (4.6). 


4. Martingales, stopping times, and the strong Markov property 433 


Consequently, we have 


(4.40) / f(w(m +) oa [tlt f(w*(t) + w(7)) dW(w*) dW), 
S 


forall v,if S € 8,4. The desired identity (4.34) follows by taking v + oo, if f € 
C (R”). Passing from this to (4.32) is then done as in the proof of Proposition 4.2. 

In particular, the extension of (4.31) analogous to (4.8), in the special case 
F (a1, 22) = f (x2 — 21), yields the identity 


f(w(7 + t) — w(r)) dW fiw W(w#) dW (w) 
(4.41) [i y= f fe 
= Bs w(t))) -W(S), 


given S ¢ 8,1. This, together with the extension to F(x,,...2%), says that 
w(t + t) — w(r) = (Et) has the probability distribution of a Brownian motion, 
independent of 8. This is a common form in which the strong Markov property 
is stated. 

It is sometimes useful to consider stopping times for which {w : T(w) = co} 
has positive measure. In such a case, the extension of Proposition 4.9 is that (4.32) 
holds for W,,-almost w in the set {w : T(w) < co}. Thus, for example, (4.33) and 
(4.34) hold, given S € 8,4 and Sc {w: T(w) < co}. 

We next look at some operator-theoretic properties of 


Q: :L?(Po,dWo) > L?(Po,dWo), Q:y = Eo(y|B), 


(4.42) ; : 
O; iL (Bo, dW) aL (Bo, dWo), Ory(w) = p(vw), 


where ¥; is given by (4.11). For each t > 0, Q; is an orthogonal projection, and 
Q;.Q+ = Q:Q; = Qs, for s < t. Note that (4.13) implies 


(4.43) Q+91 = Qo, 
since Qo is the orthogonal projection of L?(85, dWo) onto 


(4.44) R(Qo) = set of constant functions. 


Proposition 4.10. The family ©;, t € [0, co), is a strongly continuous semigroup 
of isometries of L?(3Bo,dWo), with 


(4.45) R(O1) C Ker(Q; — Qo) = {y: Eo(y|Bz) = const}. 


434 11. Brownian Motion and Potential Theory 
Proof. That ©; is an isometry follows from the measure-preserving property 
(4.14). If we apply Qo to (4.43), we get QoQ; = Qo; hence (Q: — Qo)O:z = 0, 
which yields (4.45). 
The semigroup property follows from a straightforward calculation: 
(4.46) VoU sw = Vo45sW = > Ost = 950. 
The convergence 
(4.47) 0,9 > O.y in L?(Po,dWo), ass >t, 
is easy to demonstrate for y(w) of the form (1.8), that is, 
(4.48) p(w) = f (w(t), bad ,w(tk)), 
with f continuous on R” x --- x R” (k factors). In fact, p(0s(w)) = ys(w) 9 
t(w) boundedly and pointwise on Bo for such y. Since the set of y of the form 


(4.48) is dense in L?(%8o, dWo), (4.47) follows. 


Proposition 4.11. The family of orthogonal projections Q, is strongly continuous 
int € [0, 00). 


Proof. It is easy to verify that, for any y € L?(Bo, dWo), 
(4.49) Qs¢ > Qt_-y = En(y|Bi_), ass 7 t, 


provided ¢ > 0, and 


(4.50) Qs > Q149 = En(y|Biz), ass \ Ft, 

where 

(4.51) a o(U B.), Bis = () Bs. 
sct s>t 


It is also easy to verify that B,_ = B;, fort > 0,soQ,y > Qiyass “Zt. On 
the other hand, it is not true that B,, = B;, so the continuity of Q;y from above 
requires more work. 

Suppose t; € QT and 


(4.52) O<tr <te<-+-<te<t< te <-+++ < toy. 
Let f; EC (R”). Consider any function on $B of the form 


p(w) = Ac(w) Bre(w) 


Cy = fi(wlts)) +++ fe(w(te)) - fers (w(tes1)) «++ fern (wltern)). 


Exercises 435 


Denote by C* the linear span of the set of such functions. For y of the form (4.53), 
we have 


(4.54) Eo(y|Br) = Ac(w) Eo( Bre| Bt). 

Ifte,, =t+s,, 1<vu<k, we have, by (4.8), 

(4.55)  Eo(Bre|\B+) = Bua (fer1(w(s1)) +++ fer (w(sa))), ae. on Po. 
Now, ift <t+h < te41, we also have 


Fo(y|Bitn) = Ae(w) £o( Bre|Br+n) 


4.56 
oe = Ape) Bases Gn), 


where 


(4.57) We(w) = fer1(w(si —h)) +++ fern (w(se — h)). 


Now, as in (1.9), 


Ez, (we) = f- fo 81 — h,x1)p(s2 — $1,@2 — 41) 


“+p Sp — Spa, BE — Bea) 
Pre a a te dxy,-++dxy. 


(4.58) 


The continuity in (a, h) is clear. Since paths in Bo are continuous, we have, by 
linearity, that 


(4.59) peE C= Eo(y|Bz) = ie Eo(y|Bi+n), Wo-a.e. 


Now the Stone-Weierstrass theorem implies that C* is dense in C(8), which is 
dense in L?(%8, dWo) = L?(Bo, dWo). Thus we have 


(4.60) Eo(y|Br+4) = Eo(y|B:), Wo-a.e., 


for every yp € L?(%o, dWo), and the proposition is proved. 


Exercises 


1. Show that the martingale maximal inequality applied to rz(w) = w(t) yields 
1 


Wo({w € Po: sup w(s) > ) > b/4t/n}) < 7 
O<s<t 


Compare with the precise result in (3.29). 


436 11. Brownian Motion and Potential Theory 


2. With %,— characterized by (4.51), show that 8; = +, as stated in the proof of 
Proposition 4.11. (Hint: In the characterization (4.1) of 8;, one can restrict attention to 
E open in R”.) 
3. Using (4.60), show that 
S € B84, => Wo(S) =Oorl. 
This is called Blumenthal’s 01 law. If E € R” is a closed set, show that 
{w € Bo : w(t.) € E for some t, \, 0} 
is a set in Bo. (Hint: Consider {w € Bo : dist(w(t), ZH) > 6 > Ofort € [2-”%e,e] N 
Q} = S(E, 4,€,v).) 
4. Let N be the collection of (Wo-outer measurable) subsets of 80 with Wo-measure 


zero. Form the family of o-algebras Bi = B, UN, called the augmentation of Bt. 
Show that Bi > B+ and, with notation parallel to (4.51), 


Sasa. 


Note: The augmentation of 8; is bigger than the completion of B+. 
5. Let §: be the c-algebra of subsets of $80 generated by sets of the form (4.1) for s > ¢, 
and set An = Miso $+. Using Blumenthal’s 01 law and Exercise 2 of §1, show that 


S€ As = > Wo(S) = Oorl. 
If EF C R” is aclosed set, show that 
{w € Po : w(t.) € E for some t, 7 oo} 


is asetin Ago. 


5. First exit time and the Poisson integral 


At the end of §3 we produced a formula for PI f, giving the solution u to 
(5.1) Au=0inQ, uw=f ondQ, 


at least in case 2 is a bounded domain in R” with smooth boundary. Here we 
produce a formula that is somewhat neater than (3.36) and that is also amenable 
to extension to general bounded, open 2. C R”, with no smoothness assumed on 
OQ. In the smooth case, the formula is 


(5.2) PI f(z) = Ex(f(w(¥q))), re Q, 


where J5(w) is the first exit time defined by (3.24). 

From an intuitive point of view, the formula (5.2) has a very easy and natural 
justification. To show that the right side of (5.2), which we denote by u(x), is 
harmonic on Q, it suffices to verify the mean-value property. Let x € 2 be the 


5. First exit time and the Poisson integral 437 


center of a closed ball B C {. We claim that u(a) is equal to the mean value of 
ulag. Indeed, a continuous path w starting from x and reaching 02 must cross 
OB, say at a point y = w(vg). The future behavior of such paths is independent 
of their past, so the probability distribution of the first contact point w(Uq), when 
averaged over starting points in OB, should certainly coincide with the proba- 
bility distribution of such a first contact point in OQ, for paths starting at x (the 
distribution of whose first contact point with OB must be constant, by symmetry). 

The key to converting this into a mathematical argument is to note that the time 
0 p(w) is not constant, so one needs to make use of the strong Markov property 
as a tool to establish the mean-value property of the function u() defined by the 
right side of (5.2). 

Let us first make some comments on the right side u(x) of (5.2). By (3.40) we 
have 


63) |ula)— f F(e(dq)) dWal)| < Clla~(om) N~ 
Bow 


Let us extend f € C(0Q) to an element f € Co(R”), without increasing the sup 
norm. By (3.41), we have 


(5.4) f(w(¥q)) = Jim, fn W))), forw € Py y, 


where J,,,v(w) = min(N, inf {s € 2-”Z* : w(s) ¢ Q}). Thus, if the integral in 
(5.3) is denoted by wy (x), then 


(5.5) un (x) = jim unv(x) = lim f(w(d,w(w))) dW, (w). 


Here the limit exists pointwise in « € 2. Now each uy, is continuous on Q, 
indeed on R”. Consequently, u(x) given by the right side of (5.2) is at least a 
bounded, measurable function of x. 

To continue the analysis, given x € (2, we define a probability measure vz,,.9 
on 02 by 


(5.6) E,(f(w(0q))) = if Sia: 
Q 


for f € C(OQ). 


Lemma 5.1. [fx € O CC Q and O and Q. are open, then 


(5.7) pees / Vy,0 Wz,o(y)- 
00 


438 11. Brownian Motion and Potential Theory 


Proof. The identity (5.4) is equivalent to the statement that, for f € C(OQ), 
(5.8) B,(f(w(0q))) = f Ey(flol9q))) @vs.ol0): 
00 
The right side is equal to 
(5.9) Ex(9(w(9q))), 9) = Ey (f(w(9q))). 
In other words, 
(5.10) (wW(9G)) = Baap), Pw) = f(w(Val)))- 
Now we use the strong Markov property, in the form (4.32), namely, 
E.y7)(~) = Ez (y 0 o,|Br+), 


for W,-almost all w, where (o;w)(t) = w(t + 7) and 7 is a stopping time. This 
implies 


(5.11) [ Fal) dW,,(w) = [ Ble 0 0,|B,4) dWe(w) = Ex(y oar). 
Bo Bo 


Applied to T = J, this shows that (5.9) is equal to E,(yp o T%5): Now, with 
w(t) = o9,w(t) = w(t + Vo(w)), we have, forO CC Q, Va(@) = Baw) — 
Jo(w), as long as w is a continuous path starting in O. Hence 


(5.12) 9(@) = f (Gq) — Vow) = f(~Ma))) = ee). 


Thus (5.9) is equal to F,,(y), which is the left side of (5.6), and the lemma is 
proved. 


Consequently, the right side u(2) of (5.2) is a bounded, measurable function 
of x satisfying the mean-value property. An integration yields that such u(x) is 
equal to the mean value of u over any ball D C Q, centered at x, from which it 
follows that u(x) is continuous in 2. Then the mean-value property guarantees 
that u is harmonic on 2. To verify (5.2), it remains to show that u(x) has the 
correct boundary values. 


Lemma 5.2. Assume OQ. is smooth. Given y € OQ, we have u(y) = f(y), and u 
is continuous at y € Q. 


Proof. That u(y) = f(y) follows from the fact that Jo(w) = 0 for W,,-almost 
all w, according to Proposition 3.7. To show that u(x) > u(y) as ~ > y from 
within 2, we argue as follows. 

By (3.23), for x € Q, E, (gq) = —A7~!1(x). Hence this quantity approaches 
O as x — y. Thus, given €; > 0, there exists 6 > 0 such that 


5. First exit time and the Poisson integral 439 
(5.13) |x — y| < 6 => Wz ({w: Bg (w) > e1}) <e2. 


Meanwhile, in a short time, 0 < s < €1, a path w(s) is not likely to wander far. In 
fact, by (3.28) plus a scaling argument, 


W., ={w € Bo: sup |w(s) —w(0)| > et/7} 
(5.14) 0<s<e1 
= Wz(We,) < w(e1), 


where w(e) > Oase > 0. 
Thus, if |2 — y| < 6, with probability > 1 — e2 — w(e1), a path starting at 


x will, within time ¢,, hit OO, without leaving the ball Bays (x) of radius ey! a 


centered at x. Now, a given f € C(0Q) varies only a little over {z € OD : 


lz—y|< ef +6} if €, and 6 are small enough. Therefore, indeed u(x) > u(y), 


as © —> y. 
We have completed the demonstration of the following. 


Proposition 5.3. [f Q is a bounded region in R” with smooth boundary and f € 
C(OQ), then PI f is given by (5.2). 


Recall from §5 of Chap. 5 the construction of 
(5.15) PI: C(0Q) — LY (Q)N C*(Q) 


when 2 is an arbitrary bounded, open subset of R”, with perhaps a very nasty 
boundary. As shown there, we can take 


(5.16) Q, CCN, CC?! CCD, AQ 


such that each boundary 0Q,; is smooth, and, if f is extended from OQ) to an 
element of C,(R”), then 
(5.17) ceEQ= > Pif(x) = lim u,(2), 


j-oco 


where u; € C(Q,;) is the Poisson integral of f|aq,. In (5.17) one has uniform 
convergence on compact sets kK C Q., the right side being defined for 7 > jo, 
where kK C {,,. The details were carried out in Chap.5 for f € C°°(R”), but 
approximation by smooth functions plus use of the maximum principle readily 
extends this to f € C,(R”). 

If we apply Proposition 5.3 to 2,;, we conclude that, for f € C,(R”), 7 € Q, 


(5.18) PL f(«) = Jim. Ez(f(w(¥q,)))- 


On the other hand, it is straightforward from the definitions that 


440 11. Brownian Motion and Potential Theory 
(5.19) Uo, (w) Avo(w), for allw € Bo. 


Therefore, via the dominated convergence theorem, we can pass to the limit in 
(5.18), proving the following. 


Proposition 5.4. [fQ. is any bounded, open region in R” and f € C(0Q), then 


(5.20) PI f(x) = Ex (#(~(Wa))), ceo. 


We recall from Chap.5 the notion of a regular boundary point. A point 
y € OQ is regular provided PI f is continuous at y, for all f € C(OQ). 
We discussed several criteria for a boundary point to be regular, particularly in 
Propositions 5.11—5.16 of Chap. 5. Here is another criterion. 


Proposition 5.5. [fQ C R” is a bounded open set, y € OQ, then y is a regular 
boundary point if and only if 


(5.21) E,(09) 70, asxaay, cE. 


Proof. Recall from (3.20) that E,(9q) = —A~'1(x). Thus (5.21) holds if and 
only if this function is a weak barrier at y € OQ, as defined in Chap.5, right 
after (5.26). Therefore, (5.21) here implies y is a regular point. On the other hand, 
A~1(a) can be written as the sum x7 /2+uo(x), where uo = —(1/2) PI (x7],,.), 
so if (5.21) fails, y is not a regular point. 


One might both compare and contrast this proof with that of Lemma 5.2. In 
that case, where OQ was assumed smooth, the known regularity of each boundary 
point was exploited to guarantee that E, (0g) — 0 as x — y € OQ, which then 
was exploited to show that u(x) > u(y) as x > y. 

In the next section, we will derive another criterion for y to be regular, in terms 
of “capacity.” 


Exercises 


1. Explore connections between the formulas for PI f(x), for f € C(0Q), when 2 is 
bounded and 02 smooth, given by (3.36) and by (5.2), respectively. 


6. Newtonian capacity 


The (Newtonian) capacity of a set is a measure of size that is very important in 
potential theory and closely related to the probability of a Brownian path hitting 
that set. In our development here, we restrict attention to the case n > 3 and define 
the capacity of a compact set K C R”. We first assume that K is the closure of 
an open set with smooth boundary. 


6. Newtonian capacity 441 


Proposition 6.1. Assume n > 3. If kK C R®” is compact with smooth boundary 
OK, then there exists a unique function Ux, harmonic on R” \ K, such that 
Ux (x) > Lasx—> K and UK (x) > Oas |x| > ov. 


Proof. We can assume that the origin 0 € R” is in the interior of AK. Then the 
inversion w(2) = x/|x|* interchanges 0) and the point at infinity, and the transfor- 
mation 


(6.1) u(a) = |2|-°—) w(|z|-?2) 


preserves harmonicity. We let w be the unique harmonic function on the bounded 
domain 7(R” \ K), with boundary value w(a) = |a|~"—?) on (OK). Then 
v, defined by (6.1), is the desired solution. The uniqueness is immediate, via the 
maximum principle. 


Note that the construction yields 
(6.2) [Ux(x)| < Clx|-°-?), |0,UK«(x)| < Clal|-"-Y, a] > oo. 


The n = 3 case of this result was done in §1 of Chap. 9. 

Another approach to the proof of Proposition 6.1 would be to represent Ux (x) 
as a single-layer potential, as in (11.44) of Chap. 7. This was noted in a remark 
after the proof of Proposition 11.5 in that chapter. 

Now that we have established the existence of such Ux, Exercises 7-9 of §3 
apply, to yield 


(6.3) Ui-(2) AUK(2), ast Zo, 
where, forza € O = R”"\ K, 


Ut-(x) =1—ef401(z) 


(6.4) 
= W, ({w : Yo(w) < tH). 

Here, Ao is the Laplace operator on O, with Dirichlet boundary condition. The 
last identity follows from (3.27). We can replace the first exit time Vo by the first 
hitting time: 


(6.5) hx (w) = Vpn x(w). 
Consequently, 
(6.6) Ux(x) = W,({w : )x(w) < oof); 


that is, for x € O, Ux (2) is the probability that a Brownian path w, starting at x, 
eventually hits Kk. 


442 11. Brownian Motion and Potential Theory 


We set Ux (x) = 1 for x € K. Then (6.6) holds for  € K also. It follows 
that Ux € C,(R”), and AU, is a distribution supported on OK. In fact, Green’s 
formula yields, for p € C§°(R"), 


6.7 Ux.de) =— f oly) Ux) ast), 


OK 


where v is the unit normal to OK, pointing into A. By Zaremba’s principle, 
OLUK(y) > 0, for all y € OK, so we see that AUK = —juK, where ux isa 
positive measure supported on OK. The total mass of 4x is called the capacity 
of Kk: 


(6.8) cap K = f dux(e). 


K 


Since, with C,, = (n — 2) - Area(S"~*), 


(6.9) Ux(x) =A" =Car / Jz — y|-~?) dux(y), 
we have 
duz(x) d 
(6.10) Cr Fy on ee y) = f Ute) dux(x) = cap K, 


the left side being proportional to the potential energy of a collection of charged 
pana with density dj, interacting by a repulsive force with potential C;,|a — 
y|_"—?), The function Ux (a) is called the capacitary potential of K. Note that 
we can also use Green’s theorem to get 


(6.11) VU KllZ2~@n) = exo) dux(x) = cap K. 
K 
Note that if ky C Ko have capacitary potentials U;, AU; = —;, then Uz = 1 


on Ky, so 


cap kK, = [et duy(a) = —(U2, AU;) 


(6.12) 
= {u@ din (2) < cap Ko, 


since U;(x) < 1. Thus capacity is a monotone set function. 
Before establishing more formulas involving capacity, we extend it to general 
compact kK C R”. We can write K = (|.K;, where K} DD Ky DD --- DD 


6. Newtonian capacity 443 


Kj \ K, each Kj being compact with smooth boundary. Clearly, Uj = Ux, isa 
decreasing sequence of functions < 1, and by (6.11), VU; is bounded in (R"). 
Furthermore, AU; = —j1;, where 1; is a positive measure supported on OK.,, of 
total mass cap K’;, which is nonincreasing, by (6.12). Consequently, we have a 
limit: 

(6.13) lim U; = Ux, 


Imo 


defined a priori pointwise, but also holding in various topologies, such as the 
weak* topology of L°°(IR”). We have Ux € L®°(R"), 0 < Ux(x) <1; VUK € 
L?(R"), and AUK = —p, where pu is a positive measure, supported on K. Fur- 
thermore, j1; > 1 in the weak* topology, and Ux = —A~1. Any neighborhood 
of K contains some K;. Thus, if Kj DD K3 DD --- DD Kj \, K is another 
choice, one is seen to obtain the same limit Ux, hence the same measure jz, which 
we denote as ji7¢. We set 


(6.14) cap K = | dus. 


Note that, as in (6.12), cap K = if U;(x) dux(x), for each 7. Thus, as before, 
cap K = [Ux(x) dux(z), this time by the monotone convergence theorem. 
Consequently, 


(6.15) Ux(2) =1 jx-almost everywhere. 
Clearly, cap K < inf cap K;;. In fact, we claim 
(6.16) cap kK = inf cap K;. 


This is easy to see; 4; converges to 4x pointwise on C,(IR"); choose g € 
C,(R”), equal to 1 on Ky; then 


(6.17) cap K = (9, ux) = lim (g,u;) = lim capk,, 

proving (6.16). We consequently extend the monotonicity property: 

Proposition 6.2. For general compact K C L, we have cap K < cap L. 

Proof. We can take compact approximants with smooth boundary, kK; \, K, 
L; \ £, such that kK; C L;. By (6.12) we have cap K; < cap L;, and this 


persists in the limit by (6.16). We also have Ux(x) < Uz(x) for all x. Using 
(6.15), we obtain 


(6.18) cap kK = [exle) dex (x). 


444 11. Brownian Motion and Potential Theory 


One possibility is that cap kK = 0. This happens if and only if zx = 0, thus 
if and only if Ux = 0 almost everywhere. If cap K > 0, we continue to call Ux 
the capacitary potential of K. 

We record some more ways in which U; — Ux. First, it certainly holds in 
the weak* topology on L™(R”). Hence VU; > VUx in D’(R”). By (6.11), 
VU; is bounded in L?(IR”); hence VU; + VUx weakly in L?(IR"). Since also 
U; € C.(R”), we have 


[VU |Z2 = Jim (Vj, VU) = Jim —(U;, AU) 


(6.19) 
= lim | U;(z) du(z) = cap K, 


the last identity holding as in the derivation of (6.15). Thus (6.11) is extended to 
general compact /x. Furthermore, this implies 


(6.20) VU; — VUxK in L?(R")-norm. 
Hence 
(6.21) lj —> We in H~'(R")-norm. 


We now extend the identities (6.3) and (6.6) to general compact K, in reverse 
order. 


Proposition 6.3. The identity (6.6) holds for general compact K Cc R”. 


Proof. Since (6.6) has been established for the compact A; with smooth bound- 
ary, we have 


(6.22) 1—U;,(r)=W2(Ax,), AK, ={w € Po: w(Rt) CR” \ Kj}. 


Clearly, if Ky; \y K, Me, C Un, C+ CA, # Are, where Wr is a proper 
subset of AK = {w € Po : w(R*) C R” \ K}. However, for n > 3, Brownian 
motion is nonrecurrent, as was established in Exercise 10 of §3. Thus |w(t)| — oo 
as t > oo, for W,-almost all w, so in fact W, (Hic \ Ax) = 0, and hence 
1 — UK (x) = W, (2x), which is equivalent to (6.6). 


Proposition 6.4. The identity (6.3) holds for general compact K Cc R”. 
Proof. We define U!.(x) to be 1 — e'4°1(2), as in (6.4); the second identity in 


(6.4) continues to hold, by (3.27). Now, clearly, the family of sets S; = {w € 
Bo: bx (w) < t} is increasing as t 7 00, with union 


LS: = {w € Bo > hK(w) < co}, 


6. Newtonian capacity 445 
and this gives (6.3). 
We next establish the subadditivity of capacity. 


Proposition 6.5. If K and L are compact, then 


(6.23) Uxur(2) < U(x) + Ux (2) 
and 
(6.24) cap(K UL) < (cap K) + (cap L). 


Proof. The inequality (6.23) follows directly from (6.6) and the subadditivity of 
Wiener measure. Now, as in (6.12), we have 


/ Ux(2) duxur(2) = —(Ux, SUxut) 


(6.25) = / Uxur(e) du (a) 


= cap k, 


the last identity by (6.18), with LZ replaced by kK U L. Hence 


cap Kk + cap L= f [Ux(2) + Uz(a)| duxur(2), 


so the estimate (6.23) implies (6.24). 


Note that even if A and L are disjoint, typically there is inequality in (6.23), 
hence in (6.24). In fact, if K and L are disjoint compact sets, 


(cap K) + (cap L) = cap(K UL) +R, 


(6.26) R = f Ute) duKur(2) + [ Ux(0) duKuL(2), 
B K 


the quantity R being > 0 unless either cap kK = 0 or cap L = 0. Unlike measures, 
the capacity is not an additive set function on disjoint compact sets. 

We began this section with the statement that the capacity of K is closely 
related to the probability of a Brownian path hitting AK. We have directly tied 
Ux (x) to this probability, via (6.6). We now provide a two-sided estimate on 
Uj, («) in terms of cap K. 


Proposition 6.6. Let 6(2) = sup {|x — y| : y € K}, and let d(x) denote the 
distance of x € R” from K. Then 


Ch 
CD) (cap K). 


Ch 
(6.27) (cap K) < Ux(a) < Ie) 


5(a)"— 


446 11. Brownian Motion and Potential Theory 


Proof. The formula Ux (x) = Cy f |x — y|~~?)dux (y) represents Ux (x) as 
C,,(cap K) times a weighted average of |x — y|~‘"~?) over K. Now, for y € K, 
d(x) < |x — y| < d(x), so (6.27) follows. 


We want to compare this with the probability that a Brownian path hits 0k 
in the interval [0, ¢]. It ¢ is large, we know that |w(t)| is probably large, given 
that n > 3, and hence w(s) probably will not hit K for any s > t. Thus we 
expect this probability (which is equal to Uj, ()) to be close to Ux (a). We derive 
a quantitative estimate as follows. Since 1 — Ui.(x) = e'4°1(x), we have, for 
s>0, 


(6.28) Utt#(2) — Ut(2) = e401(2) — ef't8)401 (2) = e4oUe (x), 
and taking s “ 00, we get 
(6.29) Ux (a) — Ub-(2) = e4°UK (2). 


Hence, if we denote the heat kernel on O = R” \ K by po(t, x,y), and that on 
R” by p(t, x — y), as in (1.5), 


U(x) — UK(2) 
(6.30) = ann dy < [oes — y)UK(y) dy 
=C, — ne ”) ay dux(z) < (cap K)ox« (t,x), 
where 


(6.31) ox(t,z)=C, sup / ne ae : } dy =sup | p(s,x — z) ds, 


zek 


the last integral being another way of writing e4(—A)~!6(x2 — z) when n > 3. 
An upper bound on ox (t, 2) is f,~ (4s)~"/? ds, so we have 


2 
n—2 


(6.32) 0 < Ug(x) —U%(x) < (4m) —"/74-"/2+1 (cap K). 


There is an interesting estimate on the smallest eigenvalue of —A on the 
complement of a compact set Kr, in terms of cap K, which we now describe. 
Let Q = {a € R” : 0 < a; < 1} be the closed unit cube in R”, and let K C Q 
be compact. We consider the boundary condition on functions on Q \ K: 


(6.33) u=0ondok, a4 — 9 on Q\ aK. 
Vv 


To define this precisely, let H!(Q, K) denote the closure in H'(Q) of the set 
of functions in C'°(Q) vanishing on a neighborhood of AK’. Then the quadratic 


6. Newtonian capacity 447 


form (du, dv) p2 restricted to H'(Q, K) x H1(Q, K) defines an unbounded, self- 
adjoint operator L, which we denote —Ag x, with D(L'/?) = H1(Q,K) ¢ 
H'(Q). Hence —Ag,x has compact resolvent and thus a discrete spectrum. Let 
Ao(K) be its smallest eigenvalue. 


Proposition 6.7. The smallest eigenvalue \o(K) of —A on Q\ K, with boundary 
condition (6.33), satisfies the estimate 


(6.34) Ao(K) > Yn cap K, 


for some 7p, > 0. 


Proof. Let pg,x(t, x, y) denote the heat kernel of Ag «. With O = R” \ K, let 
po(t, x, y) denote the heat kernel of A on O, with Dirichlet boundary condition, 
as in (6.30). We claim that 


Q R” 


To see this, define K by the method of images, so in each unit cube with integer 
vertices we have a reflected image of K, and, with O = R” \ kK, 


(6.36) pa.K(t,2,y) => palt,«, Ry), 2,y €Q, 
j 


where the transformations f; are appropriate reflections. Then (6.35) follows 
from the obvious pointwise estimate ps(t,7,y) < po(t,z,y). Now, if we set 


(6.37) M(t) = sup / po(t, x,y) dy, 
LEQ 
R7 


it follows that 


(6.38) sup / po.(t2,y) dy < M(t), sup / po,.x(t,2,y) de < M(t), 
x y 
Q Q 


the latter by symmetry. It is well known that the operator norm of e'42-* is 
bounded by the quantities (6.38). (See Proposition 5.1 in Appendix A.) Thus 


(6.39) |le“A2-* || < M(t). 
To relate this to capacity, note that 


(6.40) M(t) = sup (1 — Ux(2)). 
LEQ 


448 11. Brownian Motion and Potential Theory 


Now, applying the first estimate of (6.27), in concert with the estimate (6.32), we 
have 


2 
(6.41) M(t) <1-—Cyn-"/?*1 (cap K) + rae ee a (cap K). 
In particular, there exists a finite T = T;, and & > 0 such that 
(6.42) M(T) <1-—<«(cap K) <e**, 


Since this is an upper bound on |/e?42-* ||, we have \o(K) > («K/T) cap K, 
proving (6.34). 


As an application of this, we establish the following result of Molchanov on a 
class of Dirichlet problems with compact resolvent. 


Proposition 6.8. Let Q be an unbounded, open subset of R", with complement S. 
Suppose that there exists w(a) 77 co as a \, 0, such that, for each a € (0, 1], if 
R” is tiled by cubes Qa; of edge a, we have 


(6.43) cap(Qa; ‘al S) > (a)a2"—2) 


for all but finitely many j. Then the Laplace operator A on Q, with Dirichlet 
boundary condition, has compact resolvent. 


Proof. By scaling Q,; to a unit cube, we see that if (6.43) holds, then —A on 
Qa; \ S, with Dirichlet boundary condition on 0S, Neumann on 0Q,; \ 5’, has 
smallest eigenvalue > (cap Qaj 9 S)a~?"—?), which, by hypothesis (6.43) 
is > Yn(a) for all but finitely many j. The variational characterization of the 
spectrum implies that the spectral subspace of L?(Q) on which —A has spectrum 
in [0, %n4(a)] is finite-dimensional, for each a > 0, and this implies that A has 
compact resolvent. 


In our continued study of which boundary points of a region 2) are regular, it 
will be useful to have the following variant of Proposition 6.6. Here, B,. is the ball 
of radius r centered at the origin in R”; see Fig. 6.1. 

Proposition 6.9. Let K be a compact subset of the ball B,. Let Vx (x) denote the 
probability that a Brownian path, starting at x € R”, hits Kk before hitting the 
shell OB, = {x : |x| = 4}. Then there is a constant ¥,, > 0 such that 

(6.44) x € By => VR(x) > Yn (cap K). 

Proof. Note that, by (5.20), Vx is also defined by 


(6.45) AVK =00n Ba\ K, Ve =lonk, Ve =0ondB,. 


6. Newtonian capacity 449 


FIGURE 6.1 The Set K 


We will compare Vx (x) with Ux (x). By (6.27), we have 


(6.46) z € By => Ux(z) > 2°) C, (cap K) 
and 
(6.47) a € OBs => Ux(x) < 3-7) C,, (cap K). 


By (6.47) and the maximum principle, we have, for z € By \ K, 


Ux(2) — a(K) 


(6.48) Vela) = a 


, g(K)=3-" IC, (cap K). 


Now C;,,(cap Kk) < C,,(cap B,) = 1 (compare with Exercise | at the end of this 
section), so using (6.46) we readily obtain (6.44), with 


(6.49) in = (1-3-9) 7 (a-tm-2) 3-9 NG. 


In particular, 73 = C3/4 = 7. 
Of course, since V(x) < Ux(a), we also have 


(6.50) a € Ba, dist(z, K) > p => Vx(x) < Cnp-~ (cap K). 


This upper bound is valid for kK C By; we don’t need kK C By. 

Now suppose y € K is the center of concentric balls B;, of radius 2~/r, 
where r > 0 is fixed, 0 < 7 < v. See Fig. 6.2. Pick x € B,. We want to estimate 
the probability that a Brownian path starting at x will exit Bo before hitting K. 
Let’s call the probability pmiss(a, kK). Using Proposition 6.9 and scaling, we see 
that, given x € B,, the probability that it hits 0B;_2 before hitting K M B; is 


450 11. Brownian Motion and Potential Theory 


FIGURE 6.2 Setup for the Wiener Test 


<1- a ee - cap(K M B;), where r; = 2~/r. Using the independence of 


this event and of the event that, given x € 0B;_2, the path will hit OB;_4 before 
hitting KM B;~2, which follows from the strong Markov property, we have an 
upper bound 


(6.51) pmiss(a,K) < TJ (1 — Fyr (2) 9(9-2)5 . cap (KN B))), 
JESy 


where S, = {j :0 < 7 < vy, j = vmod 2}. A similar argument dominates 
Dmiss(Z, K) by a product over {1,...v} \ S,, so 


(6.52) miss, K)? < J] (1 — Far (2)9("—2)3.. cap (KN B;)). 
j=0 


Note that, as v — oo, the right side of (6.52) tends to zero, precisely when the sum 


gin—2)5 cap(K M B;) 


Ms: 


(6.53) 
0 


j 
is infinite. We are now ready to state the Wiener criterion for regular points. 


Proposition 6.10. Let Q be a bounded, open set in IR", and let y € OQ. If Q is 
inside a ball B, set K = B\ Q. Then y is a regular point for Q. if and only if the 
infinite series (6.53) is divergent, where B; = {x € R” : |x — y| < 274}. 


Proof. First suppose (6.53) is divergent. Fix f € C(0Q), and look at 


(6.54) u(x) = PI f(x) = Ex(f(w(bx))). 


6. Newtonian capacity 451 


Given € > 0, fix r > 0 so that f varies by less than e on {z € OQ: |z —y| <r}. 
By (6.52), if 6 > 0 is small enough and |a — y| < 4, then the probability that a 
Brownian path w(t), starting at x, crosses OBy = {z : |z — y| = r} before hitting 
Kx is < €. Consequently, 


655) |e—yl <9 => |B. (F(w(bx))) — FW)| Se +e- sup If. 


This shows that PI f(x) > f(y) asx > y, for any f € C(O), so y is regular. 

For the converse, if (6.53) converges, we claim there is a J < oo such that 
there exist points in QM B,, arbitrarily close to y, which are starting points of 
Brownian paths whose probability of hitting K before exiting By is < 1/2. 

Consider the shells Aj = {a : 2-77" < |x —y| < 2}; By = Ups, Ae 
We will estimate the probability that a point picked at random in Ay is the starting 
point of a Brownian path that hits K before exiting B;, where @ is chosen > J. 
Since we are assuming n > 3, by the analysis behind nonrecurrence in Exercises 
7-10 of §3, the probability that a path starting in A, ever hits By.3 is < 1/4. Thus 
if we alter K to Ky = K \ Bo+s, the probability that a Brownian path starting in 
A, hits Ke before 0B, is not decreased by more than 1/4. We aim to show that 
this new probability is < 1/4 if J is chosen large enough. 

Now there is no further decrease in probability that the path hits Ky before OB; 
if we instead have it start at a random point in By,5, since almost all such paths 
will pass into Ag, in a uniformly distributed fashion through its inner boundary. So 
we deal with the modified problem of estimating the probability p that a Brownian 
path, starting at a random point in By,5, hits Ky = K \ By+3 before exiting By. 

We partition the set {7 : J < 7 < €+3} into two sets, where j is even or odd; 
call these subsets YZ and 7/1, respectively. Then form 


(6.56) Ao= LU Ai A= U Ay. 
JETo JEN 


We estimate the probability p,, that a path starting in Be+s5 hits Ky 9 A,, before 
hitting OB. We have 


(6.57) Put) < >> Pps, 
JET u 


where p,,; is the probability that, given |x — y| = (3/4) - 2-771 (ie., x is ona 
shell S;+41 halfway between the two boundary components of A;+1), then a path 
starting at x hits K ™ A; before hitting S;_1. By (6.50) and a dilation argument, 
we have an estimate of the form 


(6.58) Pg < Wy2'"-?) cap(K 9 Aj). 


Thus the probability p that we want to estimate satisfies 


452 11. Brownian Motion and Potential Theory 


£43 
(6.59) <7, >_ 2° cap(K 1 Aj). 
j=J 


Of course, cap(iK .A;) < cap( M B;), so if (6.53) is assumed to converge, 
we can pick J sufficiently large that the right side of (6.59) is guaranteed to be 
< 1/4. 

From here it is easy to pick f € C(OQ) such that f(y) = 1 but (6.54) does not 
converge to 1 as x — y. This completes the proof of Proposition 6.10 and also 
shows that the hypothesis of convergence or divergence of (6.53) can be replaced 
by such a hypothesis on 


ell: 


(6.60) 2("-2)) . cap(K 9 Aj). 


0 


a. 
Il 


We can extend capacity to arbitrary sets S C IR”. The inner capacity cap (S) 
is defined by 


(6.61) cap (.S) = sup {cap k : K compact, K c S}. 


Clearly, cap” (K) = cap K for compact K. If U C R” is open, we also set cap 
U = cap~(U). Now the outer capacity capt (S) is defined by 


(6.62) capt (9) = inf {cap U : U open, S Cc U}. 


It is easy to see that capt(S') > cap~(S) for all S. If cap*(S) = cap~(S), 
then S' is said to be capacitable, and the common quantity is denoted cap S. The 
analysis leading to (6.16) shows that every compact set is capacitable; also, by 
definition, every open set is capacitable. G. Choquet proved that every Borel set is 
capacitable; in fact, his capacitability theorem extends to a more general class of 
sets, known as Souslin sets. We refer to [Mey] for a detailed presentation of this 
result. 

The outer capacity can be shown to satisfy the property that, for any increasing 
sequence of sets 5; C R”, 


S&S: 7 8 => cap’ (5;) #* cap’ (9). 
We establish a useful special case of this. 
Proposition 6.11. [fU; and U are open and U; / U, then 
cap U; / cap U. 


Proof. Given e > 0, pick acompact kt C U such that cap kK > cap U —e. Then 
K CU; for large 7, so cap U; > cap U — « for large 7. 


6. Newtonian capacity 453 


We next present a result, due to M. Brelot, to the effect that the set of irregular 
boundary points of a given bounded, open set is rather small. 


Proposition 6.12. Jf Q C R” is open and bounded, the set I of irregular bound- 
ary points in OQ has inner capacity zero. 


Proof. The claim is that if Kk C I is compact, then cap kK = 0. By subadditivity, 
it suffices to show the following: Given y € OQ, there is a neighborhood B of y 
in R” such that any compact K C IM B has capacity zero. 

We prove the result in the case that Q is connected. Let L = B\ (, and consider 
the capacitary potential Uz (a). In this case, R” \ L is connected. The function 
1—U_ (a) is a weak barrier at any z € LOO with the property that Uz (x) > Las 
x — z, x € R"\L. Thus it suffices to show that the set J = {z € L: Uz(z) < 1} 
has inner capacity zero. 

Let K Cc J be compact. We know that Ux (x) < Uz(a) for all 2 € R”. 
Thus Ux (x) < 1 on K. Now, by (6.15), Ux (a) = 1 for wx-almost all x, so we 
conclude that 4x4 = 0, hence cap K = 0. This completes the proof when (2 is 
connected. 

The general case can be done as follows. If Q is not connected, it has at most 
countably many connected components. One can connect the various components 
via little tubes whose total (inner) capacity can be arranged, via Proposition 6.11, 
to be arbitrarily small, say < ¢. Then the set of irregular points is decreased by a 
set of inner capacity < ¢. The reader is invited to supply the details. 


As noted in Proposition 5.5, the set of irregular points of OQ can be charac- 
terized as the set of points of discontinuity of a function E, defined on 2 to be 
—A~!1(z) for x € Q and to be 0 on OQ. Such a set of points of discontinuity is a 
Borel subset of 2, in fact an F,5-set. Thus the capacitability theorem applies: If 
Q Cc R” is a bounded open set, the set of irregular points of OQ has capacity zero. 
This sharpening of Proposition 6.12 was first established by H. Cartan. 

As we Stated at the beginning of this section, we have been working under 
the assumption that n > 3. Two phenomena that we have exploited fail when 
nm = 2. One is that A has a fundamental solution < 0 on all of R”. The other 
is that Brownian motion is nonrecurrent. (Of course, these two phenomena are 
related.) There is a theory of logarithmic capacity of planar sets. One way to 
approach things is to consider capacities only of subsets of some fixed disk, of 
large radius R, and use the Laplace operator on this disk, with the Dirichlet bound- 
ary condition. Then one looks at Brownian paths only up to the first exit time from 
this disk. The results of this section extend. In particular, the Wiener criterion for 
nm = 2 is the convergence or divergence of 


CO 


(6.63) S > j-cap(K 9 A;). 


j=l 


454 11. Brownian Motion and Potential Theory 
Exercises 


1. If kK C R” is compact, show that 


| = \a|"~?Ux (a) = Cn cap K. 
If K = B, is a ball of radius a, show that cap Bz = a1 On: 
Show generally that if a > 0 and Ka = {ax : « € K}, thencap K, = a"~? cap K. 
2. Show that cap K = cap OK. Show that the identity cap 0B, = a"~?/C,, follows 
from (6.27), with x the center of Ba. 
3. Let Car be the union of two balls of radius a, with centers separated by a distance r. 
Show that 
cap Car 7 2cap Ba, asr — co. 


Estimate the rate of convergence. 

4. The task here is to estimate the capacity of a cylinder in R”, of height b and radius a. 
Suppose C(a, b) = {2 ER" :0< an <b, ci +---+22_, < a}. Show that there 
are positive constants a, and (3,, such that 


cap C(a,1) ~ana"*, a—>0,n>4, 


cap C(a,1) ~ Bna”*, a—oo, n>3. 


Derive an appropriate result for n = 3, a — 0. 
5. Let v be a positive measure supported on a compact set K C R”, such that 


UL (x) = —- = Cr [Ss oe 5 1, 


Show that U, (a) < Ux(x) for all « € R”. Taking the limit as |x| — oo, deduce 
from the asymptotic behavior of U, (x) and Ux (x) (as in Exercise 1) that [ dv(x) < 
cap Kk. 

6. Show that, for compact K C R”, 


(6.64) cap K = int{ f Iwrte)P dx: f € Co (R"), f= Lon nbd of K}. 


(Hint: Show that a minimizing sequence f; approaches Ux.) 
Show that the condition f = 1 on a neighborhood of K can be replaced by f > 1 
on K. Show that if f € Cg(R”), A > 0, 


(6.65) cap({z € R” : |f(x)| > A}) S AP IIVEIlZ2- 


7. Show that, for compact K C R”, 


1 . 
(6.66) ak = {C » {ff 22k ae) = y ) sr Px}. 


where Pr denotes the space of probability measures supported on Kx. 
(Hint: Consider the sesquilinear form 


») = Cm ff f= yl? due) dXy) = -(O',») 


as a (positive-definite) inner product on the Hilbert space Fe (R") = {u € 
H7"(R") : supp u C K}. Thus 


7. Stochastic integrals 455 


ly(m, A)| < yu, mY? yA, A)”?. 


Take ps = (cap K)~'px € Pr, where jux is the measure in (6.8)—(6.10). Show that 
(at least, when OK is smooth), 


if i 
=a | Ux(w) Ay) = oe 


and conclude that 7(A, A) > 1/(cap A’). Then use some limiting arguments.) 
8. If K C R® is compact, relate cap K to the zero frequency limit of the scattering 
amplitude, defined in Chap. 9, §1. 
9. Try to establish directly the equivalence between the regularity criteria given by 
Propositions 5.5 and 6.10. 
10. In Chap. 5, §5, a compact set K C R” was called “negligible” provided there is no 
nonzero u € H~'(R”) supported on K’. Show that if K is negligible, then cap K = 0. 
Try to prove the converse. 
11. Sharpen the subadditivity result (6.24) to 


NE PE NAR (R") => 7(u,d) = 


cap(K UL) + cap(K 1 L) < (cap Kk) + (cap L), 
for compact sets K and L. This property is called “strong subadditivity.” 
(Hint: By (6.6), Ux(x) = W2(Six), where Sx = {w : hx (w) < co}. Show that 
Sku, = Sx USz and Skat = Sx M Sr, and deduce that 
Uxur(£) + UKar(x£) < Ux(x) + Uz (2). 
Extending the reasoning used in the proof of Proposition 6.5, deduce that 


cap K + cap L = f [ex(e) +Uz(2)| duKut(x) 


2 | [vevne) + Urns(2)| duxut(x) 
= cap(K UL) + cap(KL).) 


7. Stochastic integrals 


We will motivate the introduction of the stochastic integral by modifying the 


Feynman-Kac formula, to produce a formula for the solution operator e’(4+* ) to 
Ou Ou 
(7.1) ap Aut Xu, u(0) = f; x= oO) 5. 


As in (2.2), we use the Trotter product formula to write 


k 
(72) et(AtX) ¢ dim (e019) f. 


456 11. Brownian Motion and Potential Theory 


If we assume that each coefficient X, of the vector field X is bounded and uni- 
formly Lipschitz, then Proposition A.2 applies to (7.2), given f € L?(R"), 1 < 
p < oo, or f € C,(R”), in view of Proposition 9.13 in Appendix A. Now, for 


any k, (elt/#)* e(t/k)A) a can be expressed as a k-fold integral: 


k 
(eft/Xe(t/H14) f(a) 


(7.3) = fo f tex)oG mn 1 — 7-1) 
t 
cop 


Pe Ty po) P(g 91 x £0) dx ,---dxx, 


where (with 7p = x) 
(7.4) &;=X(aj)+rj, 17 = O(k-*). 


Now we can write 


t t t 
(7.5) PCy» @j41 Lj 783) = PG tin — 2 


) efi (e541 — 25) /2—(t/k) G5 [7/4 


Consequently, parallel to (2.4), 


(7.6) (etx e474)" F(a) eee: 
where 
(7.7) yr (w) = f (w(t) eAs)-Be), 
with 

a = jtl j 

Ante) = 5 LXE) +9] [9 G9) 
(7.8) a <s 
t 
By(w) = ren a [X((49) + r3] 


Thus we expect to establish a formula of the form 
(7.9) eAt*) f(a) = B, (fo )et-FO)) , 


where 


7. Stochastic integrals 457 


(7.10) B(t,w) = ra X (w(s))” ds, 
and 
(7.11) A(t,w) )=5 im mn DX : i))-[u(2#*y — (22) ]. 


In (7.10), X(w)? denotes 5> X;(w)?. If the coefficients X; are real-valued, this 
is equal to |X (w)|?. 

Certainly By,(w) > B(t,w) nicely for all w € Bo. The limit we now need to 
investigate is (7.11), which we would like to write as 


(7.12) A(t,w) = 5 | X (w(s)) « dw(s). 


However, w(s) has unbounded variation for W,,-almost all w, so there remains 
some analysis to be done on this object, which is a prime example of a stochastic 
integral. 

We aim to make sense out of stochastic integrals of the form 


t 
(7.13) | g(s, w(s)) - dw(s), 

0 
beginning with 
7.14 1 y (ie J 
(7.14) [ ats) -a0¢ =m, Dol t) - [o(=#) — w(Z2)]. 


This is readily seen to be well defined in L?(389,dW,,), in view of the fact that 
the terms 0;(w) = w((j + 1)t/k) — w(jt/k) satisfy 


t ; 
(7.15) |; Il.2(a35,aw.) = a (0;, 9c) 12(80,dWe) = 0, for j # t, 


the first by (1.38). Thus 


= 2 Flo 74)| 
L2(o,dWe) i zl 


(7.16) [Segabeeeo w(24)] 


For continuous g, this is a Riemann sum approximating i, \g(s)|? ds, as k > oo. 
Thus we obtain the following: 


458 11. Brownian Motion and Potential Theory 


Proposition 7.1. Given g € C([0,t]), the right side of (7.14) converges in 
L? (Bo, dW ,.). The resulting correspondence 


t 
gv | g(s) dw(s 
0 


extends uniquely to \/2 times an isometry of L?([0,t], dt) into L? (Bo, dW,.). 


We next consider 


k-1 
(7.17) Sew) = S° g(tj,w(t,)) - [w(tj4) - = Yat ); 
j=0 


where 6;(w) = w(tj41) — w(t;), tj = (7 /k)t. Following [Si], Chap. 5, we com- 
pute 


(7.18) [Sell Es¢a39,aw,) = >. Be (95(w)05 (w)9e(w)8e(w)). 
ie 


If £ > 7, O0¢(w) = w(te41) — w(te) is independent of the other factors in paren- 
theses on the right side of (7.18), so the expectation of the product is equal to 
E,(9;9;9¢)Ex(@c) = 0 since E,(6¢) = 0. Similarly the terms in the sum in 
(7.18) vanish when £ < j, so 


Il SillZ2@p0,awe) =o laj(w E, (|; | *) 


(7.19) 
= 2 | (Ig(tj,(ts))1?) (ti4a — #3): 
j 


If g and w are continuous, this is a Riemann sum approximating the integral 
2 iz ‘x (|9(s, w(s))|?) ds, and we readily obtain the following result. 


Proposition 7.2. Given g © BC ({0, t] x R"), the expression (7.17) converges as 
k — 06, in L?(B9, dW,), to a limit we denote by (7.13). Furthermore, the map 


av fof s,1(s)) « du(s) 


is \/2 times an isometry into L? (Bo, dW,,), when g has the square norm 


(7.20) Qz(g) = i “Ex(Ig(s,(8))°) ds. 


7. Stochastic integrals 459 


Note that Q;.(g) = i Jen l9(s,y)/? p(s, — y) dy ds. In case g = g(w(s)), 
we have Q..(g) given as the square of a weighted L?-norm: 


(7.21) Qala) = | \glw)P rele ~y) dy = Re(D)lgP (a), 
R” 

where 

(7.22) R,(D) = A7\(e4 — 1), r(x) = R,(D)6(a). 


We see that R;(D) € OPS~?(R"). The convolution kernel r;(2) is smooth on 
IR” \ 0 and rapidly decreasing as |x| — oo. More precisely, one easily verifies that 


1 
(7.23) r(x) < O(n, t)|2|2e- FV" /4*, for a] > os 
and 
2-—n 1 
(7.24) r(x) < O(n, t)|a|°~”, for |x| < 7 > 3, 


with |2|?~” replaced by log 1/|2| for n = 2 and by 1 for n = 1. Of course, 
ri(z) > 0 for allt > 0,2 € R"\0. 

In particular, the integral in (7.21) is absolutely convergent and Q..(g) is a 
continuous function of x provided 


(7.25) ge€L? (R"), forsomep>n, and g€ L?(R", (x)~2¢7l2l?/4t dx). 


loc 


Proposition 7.2 is adequate to treat the case where the coefficients Xj; are in 
BC(R") and purely imaginary. Since A,(w) + A(t,w) in L?(Bo, dW,), 


(7.26) eA“) + eA(4) in measure, 


and boundedly, since the terms in (7.26) all have absolute value 1. Then conver- 
gence of (7.6) follows from the dominated convergence theorem. In such a case, 
X(w)? in (7.10) is equal to —|X (w)|?. We have the following. 


Proposition 7.3. If X = iY is a vector field on R” with coefficients that are 
bounded, continuous, and purely imaginary, then 


(7.27) 
e(A+iY) f(x) = Ey ( F (w(t) el@/2) Jo ¥ ((s))-dio(s)+(1/4) Jo IY (o(8))1? 2) : 


One final ingredient is required to prove Proposition 7.3, since in this case e’* 


is not a semigroup of bounded operators, so we cannot apply Proposition A.2. 


460 11. Brownian Motion and Potential Theory 


However, we can apply Proposition A.3, with 


Sw)s(a) = f fu)p(ey- 2 - tX(@)) ay. 
If X = 7Y is purely imaginary, then, parallel to (7.5), we have 
p(tsy © it¥(2)) = plt,y — ae 0-241, 


If V is bounded and continuous, a simple modification of the analysis above, 
combining techniques of §2, yields 


(7.28) eAtX=V) F(a) eof (F (w(t) eA)? BOw)/4- Jo OQ) 


when X is purely imaginary. For another interpretation of this, consider 


H= (gE — Aj(2)) +V 


(7.29) 
Pe .. @ —dA; 15 
— ~ (az 2iA; dx; "Ox; A}) + V. 


Ff] 


Assume each A; is real-valued, and A;,0A;/Ox; € BC(R"). Then 
eH f(x) = EB, (F(w@))es), 
(7.30) S(t,w) = if A(w(s)) + dux(s) 
-if ‘(div A) (w(s)) ds — rf “V(w(s)) ds. 


Compare with the derivation in [Si], Chap. 5. 

If the coefficients of X are not assumed to be purely imaginary, we need some 
more estimates. More generally, we will derive further estimates on the approxi- 
mants S;,(w) to Ais g(s,w(s)) - dw(s), defined by (7.17). 


Lemma 7.4. [f g is bounded and continuous, then 


(7.31) E,(e*) <e", y=|Igllz~, 
and 
(7.32) E,, (e*!3e!) < 260777. 


Proof. The left side of (7.31) is 


7. Stochastic integrals 461 


1 
Ey (€90% ane efh—19n-1) = e SEs (ae on et 20i-ag 0% 1) 
7.33 oO AY 
: < S- 1 Es (290% ee eee-ekon) P(e 4) 
v=0 


=E, (ee ms gees) Eee), 


by independence arguments such as used in the analysis of (7.18). Note that 
the sums over v above have terms that vanish for odd v. Now E,(e7) = 
e(tit1-t1)”” | An inductive argument leads to (7.31), and (7.32) follows from this 
plus el¥l < e%+e™". 


We next estimate the L?(8o, dW,)-norm of Sz, — S;. Another calculation, 
parallel to (7.18)-(7.19), yields 


\|S2n — SellZ2(q3,,aw,) 


(128) 3 S- Bz (|g9(ty+1/2,0(t541/2)) — o{tj,w(t;)) ”) (tj+1 — ty41/2); 
g 


where t; = jt/k asin (7.17), and t;41/2 = (j+1/2)t/k. If we assume a Lipschitz 
condition on g, we obtain the following estimate. 


Lemma 7.5. Assume that 


(7.35) g(t, 2) — g(s,y)I? S$ Colt — s|? + Cala — y)?. 
Then 
ae t 


Proof. This follows from (7.34) plus E; (|w(t) — w(s)|?) = 2\t — s|. 


We can now make an estimate directly relevant to the limiting behavior of (7.7). 


Lemma 7.6. Given the bound |\g|| 2 < y, we have 
2 
(7.37) \|eS2* — eck \|22(Bo,aW,) < V2|| Sox _— Sellz2(0,awe) Cr . 
Proof. Using e” — e” = (u — v) ®(u, v), with |®(u, v)| < e?/"!+2I"!, we have 
: | y1/2 
(7.38) IJeS2 _ eF* || 3 (935) rd || Sok = Sellt2 (9) . [Aer as 


and the estimate (7.32), plus 2e"t” < e?” + e?”, then yields (7.37). 


462 11. Brownian Motion and Potential Theory 


With these estimates, we can pass to the limit in (7.6)-(7.7), obtaining the 
following result. 


Proposition 7.7. If X is a real vector field on R” whose coefficients are bounded 
and uniformly Lipschitz, and if f € C§° IR”), then 
(7.39) 

eAtX) F(2) = BE, (Fw) Jo X(w(s))-deo(s)— (1/4) Jo 1X (w(s))? =), 


Now that the identity (7.39) is established for X and f such as described above, 
one can use limiting arguments to extend the identity to more general cases. Such 
extensions are left to the reader. 

We now evaluate the stochastic integral i w(s) dw(s) in the case of one- 
dimensional Brownian motion. One might anticipate that it should be w(t)? /2 — 
w(0)?/2. However, guesses based on what should happen if w had bounded vari- 
ation can be misleading, and the truth is a little stranger. Let us begin with 


(7.40) j=0 


where t; = (j/k)t, as in (7.17). We also use 6;(w) = w(t;41) — w(t;) below. 
Recalling that Ie w(s) dw(s) is the limit of $7 w(t;)[w(tj;+41) — w(t;)], we write 
(7.40) as 


k-1 k-1 
(7.41) w(t)? — w(0)? = 2S > w(t;)0;(w) + S- 8;(w)?. 
j=0 j=0 


The next result is the key to the computation. 
Lemma 7.8. Given t > 0, 
k-1 


(7.42) @r(w) = [u(**y w(24)] + Qt in L2(Yo, dW), 
j=0 


ask > o. 


Proof. We have 


(7.43) 


7. Stochastic integrals 463 
the last identity by independence of the different 6;. Now we know that E;, (07) = 


2t/k; furthermore, generally E,, ([F — E,(F)]|*) < E,(F?), so it follows that 


(7.44) x (|Ox — t|?) =u (04) = 125. 


This proves the lemma. 


Thus, as k —+ oo, the right side of (7.41) converges in L?($%o,dW,) to 
16 w(s) dw(s) +t. This gives the identity 


(7.45) | w(s) dio(s) = 5 f(t)? (0)? — 24], 


for W,,-almost all w. 
More generally, for sufficiently smooth f, we can write 


(7.46) f(w@)) — FO) = D_[F(w(41)) — F(w(ts))] 


and use the expansion 


f (w(tj41)) — f (w(t) 


(7.47) / 1 2 ¢" 3 
= 0,(w) f’ (w(t) + 595) f" (w(t;)) + O(|8;(w)|*) 


to generalize (7.45) to Ito’s fundamental identity: 


(7.48) f(w() = f (w(0)) =| f' (w(s)) du(s) +f f” (w(s)) ds 


for one-dimensional Brownian motion. For n-dimensional Brownian motion and 
functions of the form f = f(t, x), this generalizes to 


f(t,w(t)) — f(0,(0)) 
(7.49) = | (Vif) (s,w(s)) - du(s) 
+f (Af) (s,w(s)) a+ [ fi(s,w(s)) ds. 
Another way of writing this is 


(7.50) df (t,w(t)) =(Vaf)- dw + (Af) dt + fi dt. 


464 11. Brownian Motion and Potential Theory 


We remind the reader that our choice of e’“ rather than e'4/? to define the 
transition probabilities for Brownian paths leads to formulas that sometimes look 
different from those arising from the latter convention, which for example would 
replace (Af) dt by (1/2)(A f) dé in (7.50). 

Note in particular that 


deo") — \ edw(t)-r7t dus(t); 
in other words, we have a solution to the “stochastic differential equation”: 
(7.51) dX =X dw(t), X(t) = rwO-"*_ 


for Wo-almost all w. Recall from (4.16) that this is the martingale 3;(w). 

We now discuss a dynamical theory of Brownian motion due to Langevin, 
whose purpose was to elucidate Einstein’s work on the motion of a Brownian 
particle. Langevin produced the following equation for the velocity of a small 
particle suspended in a liquid, undergoing the sort of random motion investigated 
by R. Brown: 


(7.52) “ =—Buvt+u'(t), v(0) = vo. 

Here, the term —/v represents the frictional force, tending to slow down the 
particle as it moves through the fluid. The term w’(t), which contributes to the 
force, is due to “white noise,” a random force whose statistical properties identify 
it with the time derivative of w, which is defined, not classically, but through 
Propositions 7.1 and 7.2. Thus we rewrite (7.52) as the stochastic differential 
equation 


(7.53) dv = —Bv dt+dw, v(0) = vo. 


As in the case of ODE, we have d(e®v) = e%'(du + Bu dt), so (7.50) yields 
d(e®'v) = e°t dw, which integrates to 


t 
v(t) = ue ** +f e FE-8) du(s) 
(7.54) 


0 
t 
= ue + w(t) — of e P-9)ux(s) ds. 
0 


The actual path of such a particle is given by 


t 
(7.55) x(t) = 2 +f u(s) ds. 
0 


In the case xp = 0, v9 = 0, we have 


7. Stochastic integrals 465 


a(t) = i | * e-Ble=") du(r) ds 
_ an [1 — eP-#)] dus(s). 


Via the identity in (7.54), we have 


(7.56) 


t 
(7.57) x(t) =|} e F-)u(s) ds. 
0 


Of course, the path x(t) taken by such a particle is not the same as the “Brownian 
path” w(t) we have been studying, but it is approximated by w(t) in the following 
sense. It is observed experimentally that the frictional force component in (7.52) 
acts to slow down a particle in a very short time (~ 10~® sec.). In other words, 
the dimensional quantity ( in (7.52) is, in terms of units humans use to measure 
standard macroscopic quantities, “large.” Now (7.57) implies 


(7.58) 7 Bag(t) = w(t), 


where x(t) denotes the path (7.57). 

There has been further work on the dynamics of Brownian motion, particu- 
larly by L. Ornstein and G. Uhlenbeck [UO]. See [Nel3] for more on this, and 
references to other work. 


Exercises 


1. If g € C’((0, t]), show that the integral of Proposition 7.1 is given by 


: se =sOnO =O = / GOs) a 
0 10) 


Show that this yields the second identity in (7.54) and the implication (7.56) = (7.57). 
2. With 6; as in (7.15), show that 


k-1 
Ex(3>10;(w)I*) +0, as k > 00. 
j=0 


(Hint: Use 2|0;|° < e|0;|? + €7|0;|* and (7.44).) 
3. Making use of Exercise 2, give a detailed proof of Ito’s formula (7.48). Assume f € 
C?(R) and 
= 2 
|[D° f(x)| < Cee", Ve>O, lal <2. 
More generally, establish (7.49). 
Warning: The estimate of the remainder term in (7.47) is valid only when |w(t;+1 — 
w(t; )| is bounded (say < K‘). But the probability that |w(t;41) —w(t;)| is > K is very 
small. 


466 11. Brownian Motion and Potential Theory 


4. Show that (7.42) implies that W,-almost all paths w have locally unbounded variation, 
on any interval [s,t] C [0, 00). 
5. If v(t,w) = Jo 9(s,w(s)) - dw(s) is a stochastic integral given by Proposition 7.2, 
show that 
Ex (p(t, -)) = 0. 
Show that 7)(t, -) is a martingale, that is, E, (a(t, -)|Bs) = w(s,-), for s < t. Compare 
Exercise 2 of §8. 


8. Stochastic integrals, II 


In 87 we considered stochastic integrals of the form 
(8.1) A(t,w) = f ooo) - dw(s), 
0 
where g is defined on [0,00) x R”. This is a special case of integrals of the form 
(8.2) wtta) = f o.u)- dels), 


where y is defined on [0, co) x 8. There are important examples of such y which 
are not of the form y(s,w) = g(s,w(s)), such as the function h in (8.1), typically. 
It is important to be able to handle more general integrals of the form (8.2), for 
a certain class of functions y on [0,00) x Bo called “adapted,” which will be 
defined below. 

To define (8.2), we extend the analysis in (7.17)-(7.19). Thus we consider 


k-1 


k-1 
(8.3) — Sk(t,w) = > p(t;,w) + [w(ty41) — w(ts)] = D5 g;(w) - 6;), 
j=0 j=0 


where, as before, 0;(w) = w(t;+1) — w(t;), ty) = (j/k)t. As in (7.18), we want 
to compute 


(8.4) S(t, \lZ2qpo,aw.) = >, Bo (e 0; ¢000). 
Ze 


Following the analysis of (7.18), we want 6¢ to be independent of the other factors 
in the parentheses on the right side of (8.4) when € > 7. Thus we demand of 
that 

(8.5) p(s, -) is independent of w(t +h)—w(t), Vt>s, h>0. 


Granted this, we see that the terms in the sum in (8.4) vanish when 7 4 @, and 


8. Stochastic integrals, I 467 


ISe(t, IME (q35,.awe) = d, Be (e517) Be (11?) 
j 


(8.6) 
= 2» Ex (|p(tj.°)|”) (ta+1 — ty). 


If p € C(R*, L?(Bo, dW,)), this is a Riemann sum approximating 


t 
2 [ B.(\o(s,-)P) ds = 2llelBaco,epoy 
0 
We use the following spaces: 


C(I, R(Q)) = {ep € CU, L’(Po, dWz)) : v(t) = Q(t), Vt € TH, 


© 12(1,R(Q)) = {e € L>(I, E2Bo, dW,)) : (t) = Quplt),¥e € Th, 


where J = [0,7], and, as in §4, Qip = E,(y|Bz). Elements of these spaces 
satisfy (8.5), by Corollary 4.4. 


Proposition 8.1. Given p € C (I ; R(Q)), the expression (8.3) converges as k = 
2” —+ oo, in the space C(I, R(Q)), to a limit we denote (8.2). Furthermore, 
w = 3(y) extends uniquely to a linear map 


(8.8) 3: L?(I,R(Q)) + CUZ, R(Q)), 
satisfying 
(8.9) 5(y)(t, -)Ilz2-R0,aw.) = V2 || vl 12((0,t)xB0.dt aW,)- 


Regarding continuity, note that 


(8.10) |S(p)(E+h,-)-I(P)(t,)Ilz2-po,aw.) = V2 ll¥llz2(t-+n]xPo,dt dWe): 


We need to verify that 3(y)(t,-) € R(Q;,). But clearly, each term y(t;,w) - 
[w(tj;+1) — w(t;)] in (8.3) belongs to R(Q,) in this case, so we have the desired 
result. 

We mention an approach to (8.8) just slightly different from that described 
above. Define a simple function to be a function y(t, w) that is constant in ¢ for ¢ 
in intervals of the form [€2~”, (¢+ 1)2~”), with values in R(Q,), s = 27”, for 
some v € Z*. Fora simple function y, the stochastic integral has a form similar 
to (8.3), namely, 


(8.11) 


468 11. Brownian Motion and Potential Theory 


where t; = j2~” andt € [f2-”, (€+1)2-”) . An identity similar to (8.6), together 
with the denseness of the set of simple functions in L?(I,R(Q)), yields (8.8). 
There is the following generalization of Ito’s formula (7.49)-(7.50). Suppose 


(8.12) X(t) = Xo +f u(s,w) ds +f u(s,w) dw(s), 


to to 
where u,v € L?(I,R(Q)). Then X € C(I, R(Q)). We write 
(8.13) dX =udt+vu dw. 


We might assume X, u, and w take values in R” and v is n x n matrix-valued. 
More generally, let w take values in R", % and u in R™, and v in Hom(R”, R™). 

If Y(t) = g(t, X(t)), with g(t, x) real-valued and smooth in its arguments, 
then 


dY(t) = (Vag) (t, X(t) - dk (4) 


(8.14) + (Dg) (t, X(t)) (a(t), dX(t)) + g(t, X(t) at, 


where (D?g)(dX, dX) = 3>(0?g/0xj;Ox,) dX; - dX), is computed, via (8.13), by 
the rules 


(8.15) dt . dt => dt * dw; = dw; * dt => 0, dw; ¢ dw, = jk dt. 
There is also an integral formula for g(t, X(t)) — g(to, Xo), parallel to (7.49): 
2 


t 
0 
g(t, X(t)) = g(to, Xo) + (seauc) vere ds 


to 


(8.16) - 
+f gu (s, X(s)) ds 4 / Z (u; ds + vje dw). 
ti to Ox; 


Here, we sum over repeated indices. The formulas (7.49) and (7.50) cover the 
special case u = 0, v = I. The proof of (8.16) is parallel to that of (7.49). 
If we apply (8.14) to g(x) = e**, m = 1, we obtain for 


H(t) = exp(AX(t) —)? lv(s,w) |? ds), 
(8.17) " 


t 

X(t) = | v(s,w) - dw(s), 
to 

the stochastic differential equation 


(8.18) dX) = AY v- dw, 


8. Stochastic integrals, I 469 


generalizing the identity (7.51). 
There is another important property that 2)(t), defined by (8.17), has in com- 
mon with 3;(w) = d)-"t, 


Proposition 8.2. Given v € L?(I,R(Q)), with values in R”, the function Y(t) 
defined by (8.17) is a supermartingale; that is, for s < t, 


(8.19) Ey (Q(t)|Bs) <N(s), Wa.-ae. on Po. 


Proof. We treat the case tg = 0. First suppose v, is a simple function, con- 
stant as a function of ¢ on intervals of the form [£2~”, (¢ + 1)2~”), with values 
in R(Qyo--), and YJ, is given by (8.17), with v = v,. We claim that 2), is a 
martingale, that is, 


(8.20) E,(Q.(t)|Bs) = Yr(s), fors <t. 


Suppose, for example, that 0 < t < 27”, so v,(s) = v,(0), for s < t. Now v,(0) 
is independent of w(t) — w(s), so in this case 


VU Ww —x)— 2 Uv 2 
E,,(,(t)| Bs) = By (er lal ->#1v» (0)? [95 
= er (0)[o(s)-a]-Ms]v,O)?? | (ee toil a, 


rd 


and the last conditional expectation is 1. A similar argument in the case £27” < 
s<t< (€+1)27”, using (8.11), gives 


Ey, (D_(t)|Bs) = Yu(tve) Ex (eee ea earay |.) = Y_(s), 


where t,¢ = (2~”, vve = v,(tve). The identity (8.20), for general s < t, follows 
easily from this. 

For general v € L? (I ,R(Q)), we can take simple v, converging to v in 
the norm of this space, and then X, — X in C(I,R(Q)), where X,(t) 


ie vuy(s,w) + dw(s). Passing to a subsequence, we can assume (for fixed s, t) 
that X,(s) > 2%(s) and X,(t) > X(t), Wz-ae.; hence Y,(s) > Y(s) and 
Y_(t) + Y(t), W,,-a.e. Then (8.19) follows, by Fatou’s lemma. 

The case of general to > 0 is easily obtained from this; one can extend v(s, w) 
to be 0 for0 < 5 < fo. 


Note in particular that s = 0 in (8.19) implies 


(8.21) Ey Coe i nas) €4, 


Using Cauchy’s inequality, we deduce that 


470 11. Brownian Motion and Potential Theory 
1/2 
(8.22) E, Cae Ez(e r fig |v ee) e 


We get a similar estimate upon replacing u(s,w) by —v(s,w), which converts 
X(t) to —X(t). Since e!*! < e® + e~*, we have (replacing by 2A) 


(8.23) Ey (eI) < 2E, ie fb |u(s,-)|? a rio 


Compare with Lemma7.4. Note that the convexity of the exponential function 
implies 


(8.24) Ey (et Fler) de) Z an B,(eF)) ds. 


Therefore, (8.23) implies 


1 r 1/2 
E, (el) < 2 | Ey (erator? )as 
t— to Jig 


(8.25) 
1/2 
<2 max E Caecele . : 
a tg<axt 
2 ft 2 
If we expand Y),(t) = ork Om fig leu (sd ds i powers of A, the coefficient 


of each J is a martingale. The coefficient of \*, for example, is 


(8.26) ale (t)|4 — 52. oP(f boots) ? ds) + s(f jv (ssw)? ds). 


This has expectation zero; hence 


5 EellX(0)I") < 52 co?( fou iv-(s,-)[? ds)) 
(8.27) , ; 
< site +488, ((f Iw.(s, JI? ds) ). 
E, (|X_(t)|*) < 48? Be (ize |? ds) v) 
(8.28) < (48|¢ — tol) of E,(lvy(s,-)|*) ds 


< (48|t _ dal) Re Ey (lor (s, I’), 


8. Stochastic integrals, I 471 


where the second inequality here uses convexity, as in (8.24). Again a use of 
Fatou’s lemma yields for 


t 
(8.29) X(t) = 7 u(s,w) - dw(s) 
the estimate 
1/2 
(8.30) XC) Ilcaepo) < (48 = tol) /? mase, |Jo(s, leo): 
oSssS 
Similarly we obtain, for t; < te, 


(8.31) ||(t1) — X(t2) Il z4¢q20) < Calta — to|'/? max |lu(s,-)I|z4(@0)> 


ti<s<to 


with C; = 48, when X(t) is given by (8.29). If X(t) is given more generally by 
(8.12), we have 


I|€(t1) — X(t2)[I24¢q20) S Colts — ta], max  |lu(s,-)|I24 (30) 
1SSNt2 
(8.32) a5 
+ Ci\ti — te| / max 
ti<s<te 


Ilu(s, “Iza (0): 


The martingale maximal inequality of Proposition 4.7 extends to submartin- 
gales, but it is not obvious that it applies to the supermartingale 2) (¢). However, it 


does apply to 2),,(t), so, for each v € Z*, we have 
(8.33) 


Wa( {w E€8o: sup ¥X,(t)— X,(to) — ae |v, (s,w)|? ds > B\) 


t€I(to,t1) 


< e 8 


elit’ ? 


where I(to, t1) = [to, t1] MQ. It follows that 


(8.34) 
ti 
We({we Bo: sup |X,(t)— X,(to)| > rf luv(s,w)[? ds +3) 
tel (to,t1) to 
< ae | 
Thus, if we have 
cap B 
(8.35) | |v,(s,w)|? ds < Ty: forwe S, 
to 


then 


472 11. Brownian Motion and Potential Theory 


(8.36)  W, (s n{weEPo: sup |X,(t)—Xp(to)| > 28}) < 26-8, 


tEI(to,ti) 
Now 
ti 
Wal {w € Po: | |v, (s,w)|? ds > ey) 
(8.37) a - 
<4 [* B,(\v(s,-)/) as. 
B to 


Taking 3 = 6, \ = 1/67, we deduce that if 


ti 
(8.38) | Ilv(s, -)IlZ2g,) a8 < Oe, 
0 


then 


(8.39) Wa({w Eo: sup |X,(t)—X,(to)| > 25}) <ete V9, 


tEI(to,t1) 


Since X,,(t) converges to 2(t) in measure, locally uniformly in ¢t, we have 


(8.40) Wo ({w Eo: sup |X(t)— X(to)| > 25}) <e+e 


tEI(to,t1) 


whenever 
t 
(8.41) [ l|u(s, E230) ds < d°e. 


The estimate (8.40) enables us to establish the following important result. 


Proposition 8.3. Let I = [0,T]. Givenv € L*(I,R(Q)), so i u(s,w)-dw(s) 
X(t) belongs to C(I,R(Q)), you can define X(t,w) so that t ++ X(t,w) is con- 
tinuous in t, for W,,-a.e. w. 


Proof. Start with any measurable function on J x Bo representing X(t); call it 


X°(t,w), so for each t € I, ¥°(t,-) = X(t), We-ae. on Po. Set X(t,w) = 
X°(t,w), fort € INQ. From (8.40)-(8.41) it follows that there is a set N C Po 


such that W,,(.NV) = 0 and o,,(¢) = X(t, w) is uniformly continuous int € 1 Q 
for each w € Bq \ N. Then, for w € Po, t € I\ Q, define X(t, w) by continuity: 


(8.42) X(t,w)= lim 2X (t,,w), we Po\N. 


INQ3tLt 


If w € N, define X(t, w) arbitrarily. 
To show that this works, it remains to check that, for each t € I, 


9. Stochastic differential equations 473 
(8.43) X(t,-) = X(t), Wea-a.e. on Po. 


Indeed, since ¥°(t,,-) + X(t) in L?-norm, passing to a subsequence we have 
X°(t,,,+) + X(t) Wz-a.e. Comparing with (8.42), we have (8.43). 


Exercises 


1. Generalize (8.30) to show that X(t) = f 


tp U(S, ) + dw(s) satisfies 


t 
F(Z bec) < Crit tol? f llv(s, JZ ep) 4s, 
to 


fork € Zt. 
2. Given y € L*([0, co), R(Q)), show that, for t > s, 


Ex (f y(t, w) - duo(7)| Bs) =0. 


Deduce that the stochastic integral w(t,w) = E p(s,w) - dw(s) is a martingale, so 
that, fort > s, 


E, (w(t, -)|Bs) = W(s, *)s 
3. Show that if v(s,w) satisfies the hypotheses of Proposition 8.2, then the supermartin- 
gale 2)(£) in (8.17) is a martingale if and only if 


E,X(t)=1, Vt>0. 


9. Stochastic differential equations 
In this section we treat stochastic differential equations of the form 
(9.1) dX = b(t, X) dt+oa(t,X) dw, X(to) = Xo. 


The function X is an unknown function on I x $89, where I = [to, T]. We assume 
to = 0. As in the case of ordinary differential equations, we will use the Picard 
iteration method, to obtain the solution X as the limit of a sequence of approximate 
solutions to (8.1), which we write as a stochastic integral equation: 


(9.2) X(t) = X¥o+ [ b(s, X(s)) ds + [ o(s, X(s)) dw(s) = ®X(t). 


to to 


The last identity defines the transformation ®, and we look for a fixed point of ®. 
As usual, £(t) is shorthand for X(t, w). If w is a Brownian path in R”, we can 
let X and b(t, x) take values in R™ and let o(t,2) be an m x n matrix-valued 
function. 


474 11. Brownian Motion and Potential Theory 


Let us assume that o(t, 2) and b(t, x) are continuous in their arguments and 
satisfy 
b(t, 2)| < Ko(1+ |x|), — |b(t,#) — bt, y)| < Lola — yl, 


(9.3) 
lo(é,2)| << Ki(l+|2)?)7, |o(t,2) — o(t,y)| < Lila — yl. 


We will use results of §8 to show that 
(9.4) ®: L*(I,R(Q)) > C(L,R(Q)), 
where, as in (8.7), 
C(I,R(Q)) = {py € C(I, L? (Po, dWo)) : v(t) € R(Qz), Wt € I}, 


and L?(I,R(Q)) is similarly defined. Note that X(s) belongs to R(Qs) if 
and only if X(s) is (equal Wo-a.e. to) a B,-measurable function on Bo, so if 
X(s) € R(Qs), then also o(s,X(s)) and b(s,X(s)) belong to R(Qs). Thus 
Proposition 8.1 applies to the second integral in (9.2), and if Xo € R(Q:,), we 
have (9.4). 

Applying (8.9) to estimate the second integral in (9.2), we have 


t 2 
|®X(t) — XollZ2¢,) < 2KG (/ (1 + |1X(s)||z2 py) as) 
(9.5) 


t 
x aK? f (1+ [12(s)|[22¢,)) as. 
0 


Also (8.9) applies to an estimate of the second integral in 


ex(t) — 090) = | [(s,X(5)) — (5, Y(s))] as 
(9.6) “a 


+ f [ols.2) - a(s,(s))] ds. 


We get 


t 2 
PEt) — OYA) |IZ2—p.) < 2L0 (/ |X(s) — D(s)I]z2 po) as) 
(9.7) 


vat f X(s) — B(s)|I2a¢p,) 4s. 


To solve (9.2), we take Xo(t,w) = Xo(w), the given initial value, and induc- 
tively define Xj4, = &X,. Note that 


9. Stochastic differential equations 475 


t 


(9.8)  X1(t,w) = Xo(w) +f b(s, Xo(w)) ds +f a(s,Xo(w)) dw(s) 


to to 


contains a stochastic integral of the form (7.14), provided Xq(w) is constant. On 
the other hand, the stochastic integral yielding X2(t, w) is usually not even of the 
form (7.13), but rather of the more general form (8.2). The following estimate will 
readily yield convergence of the sequence X;. 


Lemma 9.1. For some M = M(T) < o, we have 


(Mt — tol)?*? 


Gay ig S227. 


(9.9) |Xj41(t) — X/OHllz2@p) S 


Proof. We establish this estimate inductively. For 7 = 0, we can use (9.5), with 
X = X,, and the 7 = 0 case of (9.9) follows. Assume that (9.9) holds for 7 = 


0,...,4—1; we need to get it for 7 = k. To do this, apply (9.7) with ¥X = X,, Y = 
Xr-1, to get 


21? ue 
IR) 210 aay <M (foto as), 


(9.10) ae 
AL? im 
<i |s — to|* ds. 


This is < (M|t — A al "M(k + 1)! as long as M is sufficiently large for (9.9) to 
hold for j = 0 and also M > 222 max(1,T) +4L?. 


These estimates immediately yield an existence theorem: 


Theorem 9.2. Given 0 < to < T < oo, I = [to,T], if b and o are continuous 
on I x R” and satisfy the estimates (9.3), and if Xo € R(Qt,), then the equation 
(9.2) has a unique solution X € C(I, R(Q)). 


Only the uniqueness remains to be demonstrated. But if X and Q) are two such 
solutions, we have ®X = X and 62) = Y), so (9.7) implies 


\|€(t) — Y(t) |IZ2qp,) < right side of 9.7), 


and a Gronwall argument implies ||X(¢) — 2)(t)||z2 = 0, for all ¢ € I. 

Of course, the hypothesis that b and o are continuous in ¢t can be weakened in 
ways that are obvious from an examination of (9.4)-(9.7). Allowing 6 and o to 
be piecewise continuous in f, still satisfying (9.3), we can reduce (9.1) to the case 
to = 0, by setting b(t, x) = 0 and o(t, x) = 0 for0 <t < ty. 

If Xo has higher integrability, so does the solution X(t). To see this, in case 
Xo € LA(Bo), we can exploit (8.26)-(8.30) to produce the following estimate, 
parallel to (9.7): 


476 11. Brownian Motion and Potential Theory 
|PX(t) — Y()IEaqp,) S 
t 4 
4 _— 
(9.11) sLi({ ||X(s) D(s)llzapo) 4s) 


t 
ea S48?) L4|t tol [ I|8(s) — D(s)Ilz4¢q3,) 4s- 
to 


Using this, assuming Xp € L4(%8o,dWo), we can obtain the following analogue 
of (9.9): 


j+1 
4 (Mit = tol”) 
(9.12) £541 (€) _ Xj ()|T4(p0) < peti 
for M = M(T), on any interval t € [to, T]. We have the following: 


Proposition 9.3. Under the hypotheses of Theorem 9.2, if also X) € L*(Po, 
dW), then ¥ € C(I, L4(Bo,dWo)). 


More generally, one can establish that X € C(I, L?*(SBo)), provided Xp € 
L?*($Bo), k > 1. The case 2k = 4 enables us to prove part of the following 
important result. 


Proposition 9.4. The solution X(t) to (9.2) given by Theorem 9.2 can be rep- 
resented as X(t,w) such that, for Wo-a.e. w € Bo, the map t 4 X(t,w) is 
continuous in t. 


Proof. First we assume Xo € L*(o, dWo) and give a demonstration that is 
somewhat parallel to that of Theorem 1.1. Given ¢ > 0,6 > 0, and s,t € Rt 


such that |t — s| < 6, we estimate the probability that |X(¢) — X(s)| > e. We use 
the estimate 


(9.13) I|€(t) — ¥(s)|IZaey,) < Clt— sl, 


C =C(T), for s,t € [0,T], which follows (when t > s) from 


t 4 
IX) — XE) Ida) <C(f NXC7))le ar) 
(9.14) 
+0 f lo(r,8())Iie ar, 
together with the estimate ||(s) || 4 < C(r). Consequently, given s,t € Rt, 


C 
(9.15) Wo({w ns eGo) Xiu lS :}) < Slt. 


Now an argument parallel to that of Lemma 1.2 gives 


9. Stochastic differential equations 477 


Wo({w € Bo: |X(t1,w) — X(t;,w)| > e, for some 7 = 2,.. .}) 


(9.16) 
< Or(=,6), 
2 
when {t,,...,¢,} is any finite set of numbers in QT such that 0 < t; <--: <t, 
and t, — t; < 6, where 
(9.17) r(e,6) = min(1,C67e~*). 


The function r(<, 6) takes the place of p(e, 6) in (1.23); as in (1.21), we have 


(9.18) MED) 9, as 6 > 0, 


for each €e > 0. From here, one shows just as in the proof of Theorem 1.1 that, 
for some Z C Po such that Wo(Z) = 0, the map t 4 X(t,w) is uniformly 
continuous ont € Q*, foreach w € 3By\ Z. the rest of the proof of Proposition 9.4 
can be carried out just like the proof of Proposition 8.3. 


We now give another demonstration of Proposition 9.4, not requiring Xo 
to be in L4(Bo), but only in L?(Bo). In such a case, under the hypothe- 
ses, and conclusions, of Theorem 9.2, we have o(t,X(t)) € C(I,R(Q)). 
Hence Proposition 8.3 applies to the second integral in (9.2), so U(t,w) = 
i, a(s, ¥(s)) dw(s) can be represented as a continuous function of t, for Wo-a.e. 
w € Po. Furthermore, we have b(t, X(t)) € C(I, L?(Bo)) C C(I, L1(Bo)). 
Thus, by Fubini’s theorem, the first integral in (9.2) is absolutely integrable, hence 
continuous in t, for Wo-a.e. w. This establishes the desired property for the left 
side of (9.2). 

We next investigate the dependence of the solution to (9.2) on the initial data 
Xo, in a fashion roughly parallel to the method used in §6 of Chap. 1. Thus, let 2) 
solve 


t 


(9.19) Y(t) = Do +f b(s, Y(s)) as+ [ a(s,2(s)) dw(s). 


(0) 


Proposition 9.5. Assume that b(t,x) and o(t,x) satisfy the hypotheses of 
Theorem 9.2 and are also C in x. If X(t) and Y(t) solve (9.2) and (9.19), 
respectively, then 


(9.20) X(t) — D(#) || req) < C(t, Lo, L1)||Xo — Doll r2(p,)- 


Proof. Consider Z(t) = X(t) — 2)(t), which satisfies the identity 


478 11. Brownian Motion and Potential Theory 


Z(t) = Zo +f b'(s, X(s), Y(s)) Z(s) ds 


(9.21) 


t 
+ f o!(s,%,Ys)) (8) dels), 
to 
with Z) = Xo — Yo. Here 

1 
(9.22) b'(s,2,y) = | D,b(s, ua + (1 — u)y) du, 

0 

so b'(s,z,y)(a — y) = 0(s, x) — b(s, y), and similarly 


1 
(9.23) o'(s,2,y) = | D,zo(s,ux + (1—u)y) du. 
0 


We estimate the right side of (9.21) in L?($B9). By (9.3), 


(8,2; y)| < Lo, SO 


(9.24) | / b' (s, X(s), D(s)) Z(s) ds 


to 


t 
< Lo f |2(s)ll ds. 
L2 to 


Since |o’(s,x,y)| < Li and o’(s, ¥(s),Y(s))Z(s) € R(Q;), we have 


(9.25) L/ o' (s, ¥(s), Y(s)) Z(s) dw(s) 


to 


2 t 
<1 7. |Z(s)|[32 ds. 
L2 to 


Thus the identity (9.21) implies 
t 
(9.26) ||Z(#)|IZ2 < 3||Xo — Dollz2 + 3[Lo(t — to)” + Li] / |2(s)[lz2 ds. 
to 


Now Gronwall’s inequality applied to this estimate yields (9.20). 


Note that (9.21) is a linear stochastic equation for Z(t), of a form a little dif- 
ferent from (9.2), if X(s) and 2)(s) are regarded as given. On the other hand, we 
can regard X, 2), and Z as solving together a system of stochastic equations, of 
the same form as (9.2). 

An important special case of (9.2) is the case Xo = x, a given point of R™, so 
let us look at %*-*(t), defined for t > s as the solution to 


(9.27) XS (th =at+ / b(r, €(r)) dr + [ a(r,X(r)) dw(r). 


In this case we have the following useful property, which is basically the Markov 
property. Let 8‘ denote the o-algebra of subsets of $8, generated by all sets of 
the form 


9. Stochastic differential equations 479 
(9.28) {w © Po: w(ti) —w(si) € A}, s< si <t, <t, ACR” Borel, 
plus all sets of Wo-measure zero. 


Proposition 9.6. For any fixed t > 8, the solution ¥**(t) to (9.27) is Bt- 
measurable. 


Proof. By the proof of Theorem 9.2, we have ¥*'*(t) = limpoo Xx(t), where 
Xo(t) = x and, for k > 0, 


t t 
Xp4i(t) = o+ f b(r, Xx (r)) ar+ f a(r,Xz(r)) dw(r). 

It follows inductively that each X;,(t) is B¢-measurable, so the limit also has this 

property. 


The behavior of X**(t) will be important for the next section. We derive 
another useful property here. 


Proposition 9.7. For s < 7 < t, we have 
(9.29) HPS (tw) = XP (tw), g=X"*(7,w), 
for Wo-a.e. w € Bo. 


Proof. Let 2)(¢) denote the right side of (9.29). Thus 2)(7) = 2X” *(7). The 
stochastic equation satisfied by ¥*:*(t) then implies 


t t 


Y(t) = X”*(7) +/ b(r, Y(r)) ar+ f a(r,Y(r)) dw(r). 


T T 


Now (9.27) implies that X***(t) satisfies this same stochastic equation, for t > 7. 
The identity 2)(¢) = X**(t) ae. on Bo follows from the uniqueness part of 
Theorem 9.2. 


Exercises 
1. Show that the solution to 
dX = a(t)X(t) dt + b(t)X(t) dw(t), 
in casem = n = 1, is given by 


t 
0 


(9.30) %(t) = X(0) exp{ [ [a(s) — b(s)?] ds + / b(s) deo(s) } = x(0)3, 


In this problem and the following one, X(t) depends on w, but a(t) and b(t) do not 
depend on w, nor do f(t) and g(t) below. 


480 11. Brownian Motion and Potential Theory 


2. Show that the solution to 
dX(t) = [ f(t) + a(t)X(t)] dt + [g(t) + b(t) X(t)] dw(t), 


in case m = n = 1, is given by X(t) = e>Q)(t), where e is as in (9.30) and 


Y(t) = X(0) + [ len f(s) - a(s)b(s)| ds + / “g(s)e2™ dus(s). 


0 


3. Consider the system 
(9.31) dX(t) = [A(t)X(t) + f(t)] dt + g(t) du(t), 


where A(t) € End(R™), f(t) € R™, and g(t) € Hom(R”, R™). Suppose S(t, s) is 


the solution operator to the linear m x m system of differential equations 
dy 
= A(t S(t,t) =I 
ne AO St) =, 
as considered in Chap. 1, §5. Show that the solution to (9.31) is 
t t 
X(t) = S(€,0)X%(0) + / S(t, s) f(s) ds +f S(t, s)g(s) dw(s). 
0 ) 
4. The following Langevin equation is more general than (7.52): 
(9.32) a(t) = -VV(a(t)) — Ba'(t) + w(t). 


Rewrite this as a first-order system of the form (9.1). Using Exercise 3, solve this equa- 


tion when V (a) is the harmonic oscillator potential, V (a) = ax”. 


10. Application to equations of diffusion 


Let X**(t) solve the stochastic equation 


(10.1) X(t) =o+ | b("(r)) ar+ f arr) dw. 


As in (9.2), a and b can take values in R™ and o values in Hom(R”,R”™). We 
want to study the transformations on functions on R”™ defined by 


(10.2) O2 f(x) = Eo F(X), OSs <t. 
Clearly, ¥**(s) = x, so 


(10.3) ®i f(x) = f(z). 


10. Application to equations of diffusion 481 


We assume (2) and a(x) are bounded and satisfy the Lipschitz conditions of 
(9.3). For simplicity we have taken b and o to be independent of ¢ in (10.1). We 
claim this implies the following: 


(10.4) Do f(x) = OF°° f(z), 
for s,t > O. In fact, it is clear that 


(10.5) X78 (t+ s,w) = £9 (t,0,w), 


where U,w(T) = w(t +5) —w(s), as in (4.11). The measure-preserving property 
of the map V, : Bo — ‘Bo then implies 


Eo f(¥°(t, 0ew)) = Eo f(X*°(#)) = Sof (2), 
so we have established (10.4). Let us set 
(10.6) Pit abi f =p fe), 


where for notational convenience we have set X*(t) = X*°(t). 
We will study the action of P‘ on the Banach space C,(R™) of continuous 
functions on R™ that vanish at infinity. 


Proposition 10.1. For each t > 0, 
(10.7) P*: C,(R™) — C,(R”), 
and P* forms a strongly continuous semigroup of operators on C,(R"). 


Proof. If f € C,(R™), then f is uniformly continuous, that is, it has a modulus 
of continuity: 


(10.8) |f(z) — f(y)| < ws (lz - yl), 


where w (dé) is a bounded, continuous function of 5 such that wy(d) — 0 as 
5 — 0. Then 


ans) FPS) = PW] < Bo |F(%*) — £0) 
< Ey wz (|¥*(t) — ¥%(6))). 

Now if x is fixed and y = x, — a, then, for each t > 0, ¥*(t) — ¥*"(t) > 0 

in L?($o), by Proposition 9.5. Hence X*(t) — X*»(t) — 0 in measure on Bo, 

so the Lebesgue dominated convergence theorem implies that (10.9) tends to 0 as 

y — x. This shows that P'f € C(R™) if f € C,(R™). 


482 11. Brownian Motion and Potential Theory 


To show that P! f (x) vanishes at infinity, for each t > 0, we note that, for 
most w € ‘Bo (in a sense that will be quantified below), |%*(t) — x] < C(t) if 
C is large, so if f € C,(IR™) and |a| is large, then f(X*(t,w)) is small for most 


We Bo. 
In fact, subtracting x from both sides of (10.1) and estimating L?-norms, we 
have 


(10.10) ||¥*(t) — #\|7.2 (930) < 2B7?+257t, B=sup |b], S =sup |ol. 
Hence 


2B7t? + 257t 


(10.11) Wo({w E Po : |¥7(t,w) — a] > \}) < 2 


The mapping property (10.7) follows. 
We next examine continuity in t. In fact, parallel to (10.9), we have 


(10.12) |P’ f(x) — P* f(x)| < Eo wy (|X*(t) — ¥*(s)]). 

We know from §9 that ¥*(t) € C(IRt, L?($8o)), and estimates from there readily 
yield that the modulus of continuity can be taken to be independent of x. Then the 
vanishing of (10.12), uniformly in x, as s — t, follows as in the analysis of (10.9). 


There remains the semigroup property, P*’P*~* = P*, forO0 < s < t. By 
(10.4), this is equivalent to O50! — Dj. To establish this, we will use the identity 


(10.13) Bo(£(*"(0)|Bs) = Bo £(¥""(O) = OF), 


which is an immediate consequence of Proposition 9.6. If we replace s by 7 in 
(10.13), and then replace x by ¥**(7), with s < 7 < t, and use the identity 


(10.14) Xt (t) = X(t), g= X*(r), 
established in Proposition 9.7, we obtain 
(10.15) Eo (s(e*"()|B-) = OF f(X**(r)). 
We thus have, for s < 7 < t, 
O70 f(2) = Eo(#1 f(X*"(7)) |B.) 
(10.16) = Ep (Zo (1(=*7() |B, 
= Eo( f(X*7(t))|Bs), 


B,) 


10. Application to equations of diffusion 483 


and again using (10.14) we see that this is equal to the left side of (10.13), hence 
to &! f(a), as desired. This completes the proof of Proposition 10.1. 


We want to identify the infinitesimal generator of P’. Assume now that Df, 
for |a| < 2, are bounded and continuous on R™. Then Ito’s formula implies 


f(¥2(t)) = f(a) + [( —s ) ojeone dr 


© OF 
+ ——(b; dr + o;¢ dwe), 
[ gees ar + ose de) 


(10.17) 


using the summation convention. Let us apply Eo to both sides. Now 


t 
a 
(10.18) o( [ ct ope dwt) =, 
0 


Ff] 


so we have 


(10.19) 


where Aj; in the first integral is given by 
(10.20) Ax(y) = So ojey)one(y), y= ¥%(0). 
£ 


In matrix notation, 
(10.21) A=ao". 
We can take the t-derivative of the right side of (10.16), obtaining 


a 
—P'f = 
(10.22) a? Te) 


Bo (Aju (¥*(t)) Ge sf (X*()) + by (X(t) jf (¥"(1)) )- 


In particular, 


(10.23) Pt (0) |,_g = D> Aiw(2) BOs (@) +O dj(0) F(a) = LF), 
jk J 


484 11. Brownian Motion and Potential Theory 


where the last identity defines the second-order differential operator L, acting 
on functions of x. This is known as Kolmogorov’s diffusion equation. We have 
shown that the infinitesimal generator of the semigroup P*, acting on C,(R™), is 
a closed extension of the operator 


(10.24) L= J Ajn() Oj, + D> dj(2) 0;, 


defined initially, let us say, on C2 (R™). 
It is clear from (10.6) that ||P? f|| p< < ||f||z- for each f € C,(R™), so P® 
is a contraction semigroup on C,(R”). It is also clear that 


(10.25) f>0— P'f>0 oR”, 
that is, P* is “positivity preserving.” For given x € R", t > 0, fi P’ f(x) isa 


positive linear functional on C,(IR™). Hence there is a uniquely defined positive 
Borel measure juz, on R™, of mass < 1, such that 


(10.26) Pas fro pte ,t(y). 


In fact, by the construction (10.6), 

(10.27) Mat = Fe,t)xWo, 

where F(x ¢)(w) = X*(t,w), and (10.27) means piz,4(U) = Wo (Few (U)) fora 
Borel set U C R™. This implies that, for each x,t, j42,4 is a probability measure 


on R”, since |¥*(t)| is finite for Wo-a.e. w € Bo. 
We will use the notation 


(10.28) P(s,2,t,U) =prt-s(U), O<s<t, U CR”, Borel. 


We can identify P(s,x,t,U) with the probability that ¥**(t) is in U. We can 
rewrite (10.26) as 


(10.29) P* f(x) = [rw P(0, x,t, dy) 
or 
(10.30) &¢ f(x) =f fw) P(s, x,t, dy). 


The semigroup property on P* implies 


(1031) P(s,2,t0) = f Pls,e,7dy) Ply), OSS St, 


10. Application to equations of diffusion 485 


which is known as the Chapman—Kolmogorov equation. 

Let us denote by £ the extension of (10.24) that is the infinitesimal generator 
of P*. If V is a bounded, continuous function on R™, then £ — V generates a 
semigroup on C,(IR™), and an application of the Trotter product formula similar 
to that done in §2 yields 


(10.32) e(£-Y) F(x) = Ey (7(#"() as verte a) 


This furnishes an existence result for weak solutions to the initial-value 
problem 


a 
5 = > Ase(@) ) OjOeu + S— d;(x) Oju— Vu, 
u(0) = f €C,(R™), 


(10.33) 


under the hypotheses that V is bounded and continuous, the coefficients b; are 
bounded and uniformly Lipschitz, and Aj, has the form (10.20), with a; bounded 
and uniformly Lipschitz. As for the last property, we record the following fact: 


Proposition 10.2. If A(x) is a C? positive-semidefinite, matrix-valued func- 
tion on R™ with D® A(x) bounded on R™ for |a| < 2, then there exists a 
bounded, uniformly Lipschitz, matrix-valued function a(x) on R™ such that 


A(x) = o(x)o(x)*. 


This result is quite easy to prove in the elliptic case, that is, when for certain 


Aj E (0,00), 
(10.34) DolEl? < S— Aja(@)EjE < AUlél?, 


but a careful argument is required if A(x) is allowed to degenerate. See the exer- 
cises for more on this. 

If A; (x) has bounded, continuous derivatives of order < 2, we can form the 
formal adjoint of (10.24): 


(10.35) L' f = 9-00 (Aje (2) f) — D> 0; (0; =Lf-—Vf, 


where L has the same second-order derivatives as L, though perhaps a different 
first-order part, and V(x) = — > 0;0%Ajx(2%)+ > Onb; (x). Thus L has an exten- 
sion, which we denote as £, generating a contraction semigroup on C,(R™), with 
the positivity-preserving property. Furthermore, £=V generates a semigroup on 


C,(R™), and there is a formula for et(£-V) f parallel to (10.32). Thus we obtain 
a weak solution to the initial-value problem 


(10.36) 2 = 3° dj, (Agu (x)u) — S© 0; (b; u(0) = f €C,(R™), 


486 11. Brownian Motion and Potential Theory 


provided that A;;,(a) satisfies the conditions of Proposition 10.2, and that each b; 
is bounded, with bounded, continuous first derivatives. Equation (10.36) is called 
the Fokker-Planck equation. 

To continue, we shall make a further simplifying hypothesis, namely that the 
ellipticity condition (10.34) hold. We will also assume Aj;,(a) and b;(a) are 
C*, and that D°A;;,(2) and D%b;(x) are bounded for all a. In such a case, 
(gj) = (Ajx)~! defines a Riemannian metric on R™, and if A, denotes its 
Laplace operator, we have 


(10.37) Lf=A,ft+Xf, 


for some smooth vector field X = }> &; (x) 0;, such that D°€,; (x) is bounded for 
|a| < 1. Note that if we use the inner product 


(10.38) (f,9) = / f(x)g(x) dV(a), 


where dV is the Riemannian volume element determined by the Riemannian 

metric gj,, then this puts the same topology on L?(R”) as the inner product 

J f(x)g(x) dx. We prefer the inner product (10.38), since A, is then self-adjoint. 
Now consider the closed operator £2 on L?(R™) defined by 


(10.39) Lof =Lf on D(L2) = H?(R™). 


It follows from results on Chap. 6, §2, that £2 generates a strongly continuous 
semigroup e’“? on L?(IR”). To relate this semigroup to the semigroup P' = e'” 
on C,(IR™) described above, we claim that 


(10.40) eof — ef, for f € CS°(R™). 


To see this, let uo(t, x) and ui(t, x) denote the left and right sides, respectively. 
These are both weak solutions to O,u; = Luj;, for which one has regularity 
results. Also, estimates discussed in §2 of Chap. 6 imply that uo(t, x) vanishes as 
|x| — oo, locally uniformly in ¢ € [0, co). Thus the maximum principle applies 
to ug(t, x) — u(t, x), and we have (10.40). From here a simple limiting argument 
yields 


(10.41) e“sf—e Ff, for f € C.(R™)N L7(R™). 


Now the dual semigroup (e’“2)* is a strongly continuous semigroup on 
L?(R"), with infinitesimal generator £4 defined by 


(10.42) Eif=L? mPs) = 27k"), 


10. Application to equations of diffusion 487 


where L’ is given by (10.35). An argument parallel to that used to establish (10.41) 
shows that 


(10.43) (et#2)*“f = e!2 f = e£-Y) f, for f € C,(R™) 0 L?(R™). 
On the other hand, (P’)* = (e'“)* is a weak*-continuous semigroup of oper- 


ators on IN(IR™), the space of finite Borel measures on R™; it is not strongly 
continuous. Using (10.43), we see that 


(10.44) (f,e!2g) = (e4-Y) fg), for f,g € Co(R™), 


and bringing in (10.40) we have 
(10.45) (et£)* f= e(2—-V) f, 


for f € C§°(R™), hence for f € Co(R™) N L1(IR™). From here one can deduce 
that CaN preserves [1(IR™) and acts as a strongly continuous semigroup on 
this space. 

Let us return to the family of measures P(s, x,t, -). Under our current hypothe- 
ses, regularity results for parabolic PDE imply that, for s < ¢, there is a smooth 
function p(s, x,t, y) such that 


(10.46) P(s,x,t,U) = [oeety dy. 
U 
We have 
(10.47) 6 f(x) = fro p(s,x,t,y) dy, s<t, 
and 
(10.48) (®.) f(y) = [io p(s,a,t,y) dz, s<t. 


Furthermore, we have for p(s, x,t, y) the “backward” Kolmogorov equation 


Op Op Op 
10.4 ee Ae ~~ b;(2) 2 
ee Os ye in(®) 5 Oa, ye Ne) ae 


and the Fokker-Planck equation 


Op e a 
(10.50) ~ ps oe (Aje(u)p) - = oa (2;(y)p). 


S 


488 11. Brownian Motion and Potential Theory 


While we have restricted attention to the smooth elliptic case for the last set 
of results, it is also interesting to relax the regularity required on the coefficients 
as much as possible, and to let the coefficients depend on ¢, and also to allow 
degeneracy. See [Fdln] and [StV] for more on this. Exercise 5 below illustrates 
the natural occurence of degenerate L. 

We mention that, working with (10.32), we can obtain the solution to 


Ou 
ot = Lu, fort >0, re 
(10.51) A eS ee 


u(t,z) =0, forx € 00, u(0,x) = f(x), 


by considering a sequence V,, —> co on R™ \ Q, as in the analysis in §3, when 2 
is an open domain in R™, with smooth boundary, or at least with the regularity 
property used in Proposition 3.3. In analogy with (3.8), we get 


(10.52) u(t) = Eo( f(¥*(t)) val"), 
where 


(10.53) Uae api ae (0 t]) cQ, 
0 otherwise. 

The proof can be carried out along the same lines as in the proof of 
Proposition 3.3, provided £2 (defined in (10.39)) is self-adjoint. Otherwise a 
different approach is required. Also, when £2 is self-adjoint, the analysis leading 
to Proposition 3.5 extends to (10.51), for any open 2. C R”™, with no bound- 
ary regularity required. For other approaches to these matters, and also to the 
Dirichlet problem for Lu = f on Q, in both the elliptic and degenerate cases, see 
{Fdln] and [Fr]. 

We end this section with a look at a special case of (10.1), namely when o = J, 
SO we solve 


(10.54) X(t) = 2+ u(t) + [ b(¥*(r)) dr. 


Assume as before that b is bounded and uniformly Lipschitz. Then the analysis of 
(10.6) done above implies 


(10.55) ee) Fa) = Ey F(R), X= > blz)o; 


On the other hand, in 87 we derived the formula 


(10.56) et(O+X) F(a) = Be ( f(w(t))e7), 


10. Application to equations of diffusion 489 


Z(t) = 5 f b(w(s)) - duw(s) — if |b(w(s)) : 


We conclude that the right-hand sides of (10.55) and (10.56) coincide. We can 
restate this identity as follows. Given x € IR”, we have a map 


where 


(10.57) E”: Bo > Bo, =E*(w)(t) = X(t). 

Then Wiener measure Wo on ‘Bo gives rise to a measure =? Wo on SBo, by 
(10.58) =7Wo(S) = Wo((S")*(S)). 

For example, if 0 < t) <--- < tk, 


(10.59) [Fee ...,W(ty)) dE2Wo = Ey F(X*(t1),..., £7 (tx)). 
Po 


Thus the identity of (10.55) and (10.56) can be written as 
(10.60) [i fiw )) dE%Wo = [i fiw Wy. 


This is a special case of the following result of Cameron-Martin and Girsanov: 


Proposition 10.3. Given t € (0,00), =r Woles, 
respect to W,, | %, with Radon-Nikodym derivative 


is absolutely continuous with 


ge WO 2): 


10.61 
en dW, 


Note that by taking f,, 7 1 in (10.56), we have E,(e2) = 1, so the super- 
martingale e*) is actually a martingale in this case. 

To prove the proposition, it suffices to show that, for0 < t) <-:-<t, <t#, 
and a sufficiently large class of continuous functions f;, 


Bo(fa(¥*(th)) +++ fir(X"(te))) 
=. (A (w(tr)) «++ fe (wo(ty))e2). 


We will get this by extending (10.55) and (10.56) to formulas for the solution 
operators to time-dependent equations of the form 


(10.62) 


Ou 


(10.63) Bt 


=(A+X)u-V(t,xz)u, u(0) = f. 


490 11. Brownian Motion and Potential Theory 


Only the coefficient V(t, «) depends on t; X does not. Parallel to (2.16), we can 
extend (10.55) to 


(10.64) u(t) = Eo (12 W)e Se Eee 
and we can extend (10.56) to 
(10.65) u(t) = By (<2 F w()em Bi V(su(e) ds) 


Now we can pick V(s,2) to be highly peaked, as a function of s, near s = 
t1,...,t, in such a way as to get 


(10.66) eo Je V(s,w(s)) ds ~ eo! (w(t) bans e Vk (w(te)) ; 


Thus having the identity of (10.64) and (10.65) for a sufficiently large class of 
functions V(s, a) can be seen to yield (10.62). We leave the final details to the 
reader. 

For further material on the Cameron-Martin-Girsanov formula (10.61), see 
[Fr], [Kal], [McK], and [@k]. 


Exercises 
1. As an alternative derivation of (10.13), namely, 


Bo(£("*()[Bs) = P"F@), 


via the Markov property, show that in light of the identity (10.5), it follows by applying 
(4.12) to Eo (f(2"(t— s, Bow) |B). 
2. Under the hypotheses of Proposition 10.1, show that, for A > 0, 


Eo (eles) < 2” S74 ABE, 


(Hint: If Z(t) denotes the last integral in (10.1), use (8.23) to estimate the quantity 
Eo (er!Z01) ) Using this estimate in place of (10.10), get as strong a bound as you can 
on the behavior of P' f(a), for fixed t € R™, as |x| — 00, given f € Co(R”), that is, 
f continuous with compact support. 

3. Granted the hypotheses under which the identity (e*“)* = ef-V) on the space 
C.(R™) N L1(IR™) was established in (10.45), show that if P(t) denotes te)" 
restricted to L'(R™), then P(t) = P(t)* : L°(R™) > L®(R™) is given by the 
same formula as (10.6): 


P(t) f(x) = Eo f(X*(t)), fe L™(R™). 


Show that 
P(s,x,t,U) = P(t — s)xu(2). 


11. Diffusion on Riemannian manifolds 491 


4. Assume A(z) is real-valued, A € C?(R™), and A(x) > 0 for all x. Show that 


; . 2A(x) 
VAC)? < 44(@) sup {|D°AW)] lel < aay 


Use this to show that ,/ A(z) is uniformly Lipschitz on R™, establishing the scalar case 
of Proposition 10.2. (Hint: Reduce to the case m = 1; show that if A’(c) > 0, then A’ 
must change by at least A’(c)/2 on an interval of length < 2A(c)/A’(c), to prevent A 
from changing sign. Use the mean-value theorem to deduce | A” (¢)| > | A’(c)|?/4A(c) 
for some ¢ in this interval.) For the general case of Proposition 10.2, see [Fdln], p. 189. 

5. Suppose (10.1) is the system arising in Exercise 4 of §9, for = (x, v). Show that the 
generator L for P* is given by 


(10.67) L= [Gu vial? ay) 2 


6. Using methods produced in Chap. 8, 86, to derive Mehler’s formula, compute the inte- 
gral kernel for e'” when L is given by (10.67), with V(x) = ax”. 
Remark: This integral kernel is smooth for t > 0, reflecting the hypoellipticity of 0, —L. 
This is a special case of a general phenomenon analyzed in [Ho]. A discussion of this 
work can also be found in Chap. 15 of [T3]. 


11. Diffusion on Riemannian manifolds 


We extend the construction of Wiener measure done in §1 from the setting of path 
space on R” to that of path space on a family of complete Riemannian manifolds. 
Thus, let 1/7 be a complete Riemannian manifold, with Laplace Beltrami operator 
A. 

We begin with some preliminaries on the heat semigroup. As shown in §2 
of Chapter 8, A is essentially self adjoint on Cg°(M). Let A also denote the 
self adjoint extension. Then the heat semigroup {et : t > O} is a strongly 
continuous semigroup on L?(M/). As seen in Chapter 6, §2, this heat semigroup 
has the positivity preserving property 


(11.1) 7g =0eSe 0. 
It also follows from the construction there that, for g € C§°(M), 
(11.2) O<g<1l= eg <1. 


Taking such g, we have, for f € C§°(M), 


f>O0=> | g(x)ef(x) dV(x) = | (eg(x)) f(x) dV(z) 
(11.3) | | 


492 11. Brownian Motion and Potential Theory 


and a limiting argument gives 
(11.4) (sos ferro dV (2) < [ro dV(z). 
M M 


We also have 


(11.5) eo fa) = / A(t,x,y) f(y) dV (y), 
M 


for t > 0, with h(t, 2, y) smooth, satisfying estimates such as (2.30) of Chapter 
6, Le., 


2 2 
(116)  O<A(t,2,y) < Crr(e, d)a(y, 9)(1 + ee) Pal ae 


with dist(x, y) = p+ 206, k > n/4. Furthermore, as shown in Proposition 2.4 of 
Chapter 6, we have, for f € C§°(M), 


(11.7) fro0=> | e“f(e)dV(a) = | f(x) dV(a), 
[ 


improving (11.4), if / is complete and, for some p € M, 2 < ~, 
(11.8) Vol Bs(p) < Cye®*, Ws <oo. 
Note that (11.4) implies e“ extends uniquely to 
(11.9) 4: I1(M) > D'(M), lef ilns < [lf llin. 
Hence, by duality and interpolation, 
(11.10) ef: L°(M) —>+ LP9(M), 1<p<oo, |lefl|lz> < |lfllze. 
Furthermore, as long as (11.7) holds, 
(11.11) e“iejel, Veo do, 
The following is an important complement to (11.10). 


Proposition 11.1. Let M be a complete Riemannian manifold. Then, for p € 
[1, 00), {e4 : t > 0} is a strongly continuous semigroup on L®(M). 


Proof. To start, we take f, g € C§°(M) and write 


11. Diffusion on Riemannian manifolds 493 


t 
(11.12) (ot 7a) =| (eA f, 9) ds. 
0 


In this setting, (e**Af,g) is continuous in s > 0, and we have the pointwise 
bound 


(11.13) (ce Af, 9)| <AFllcellglize, for 1 <p <oo. 
Hence 

(11.14) (ef — f9)| < tlAfllzellgllze, 1<p< oo, 

sO 

(11.15) lle" fF — fllze <t|Afllze, 1<p<oo, 


for all f € C§°(M). If 1 < p < co, C§(M) is dense in L?(M), and the uniform 
operator bound (11.10) yields the asserted strong continuity on L?(M). 


We also get a result valid for p = oo. To state it, set C,.(17) = C(M) if M is 
compact, and 


(11.16) C,(M) = {f € C(M) : f(co) = 0} 


if M is not compact, where M is the one point compactification of M/. (Set M= 
M if M is compact.) 


Proposition 11.2. In the setting of Proposition 11.1, we have for u(t) = e'f 
that 


(11.17) f €C.(M) = u€ C((0, 00), L*(M)). 

Proof. The estimate (11.15) implies that if f € C§°(/), then u is continuous at 
t = 0, with values in L°(M). Since C§°(M) is dense in C,.(14) in L°-norm 
and e'“ has operator norm < 1, we get 


(11.18) f €C,(M) = lef - filz~- 3 Oast \, 0. 


To proceed, we take t > 0. We have, forh > 0, f € C,(M), 


je+4 f— ef FIlzm = llet(e4 fF — All 


(11.19) 
< lle" f -— fila~ 4 0 


as h \, 0. If also h < t, 


494 11. Brownian Motion and Potential Theory 


Ile’ f — ef Ff poo = |e MA (EMA F — f) || Lo 


(11.20) 
< lle" f - fllzeo 0. 


This proves (11.17). 


REMARK. If M is compact, it is elementary that 
(11.21) fe C(M) = we C([0, 00), C(M)). 

When MM is complete but not compact, desired improvements on Proposition 
11.2 take more work. We prove one such result here. For simplicity, we assume 
M has C’°° bounded geometry. That is to say, there exists r97 > O such that for 
each p € M the exponential map Exp, : T,M — M gives a diffeomorphism 
of B,,(0) C T,M onto its image, and the pull back of the metric tensor on MZ 


to B,,(0) forms a C°° bounded family of metric tensors, as p varies over M. In 
such a case, we can take 6 = ro /2 and «(x, 0) independent of x in (11.6), giving 


2 
(11.22) OS A(tx,y) <C(1+ eR tp?)k) eH, 
Also we have, not only (11.8), but actually an exponential volume bound, 


(11.23) Vol B.(p) < CeX®, Vs <co, 


with C' and K independent of p € M. 


Proposition 11.3. Assume M has C'™ bounded geometry. Then 

(11.24) e'“ : C,(M) —+ C.(M),_ strongly continuous semigroup. 
Proof. To get e’* : C,(M) — C,(M), it suffices to show that 

(11.25) e4 : C@(M) — C,(M). 


This is a straightforward consequence of (11.22). The strong continuity in (11.24) 
then follows from (11.17). 


Corollary 11.4. Jn the setting of Proposition 11.3, 
(11.26) e!“ : C(M) — C(M), strongly continuous semigroup. 


Proof. Given (11.23), we have e’41 = 1, by (11.11), and (11.26) follows from 
this plus (11.25). 


Wiener measure. Parallel to $1, we pick xq € M and construct Wiener measure 


11. Diffusion on Riemannian manifolds 495 


W,, as a probability measure on the “path space” 


(11.27) g= [[™% 


tEQr 


where (as stated below (11.16)) M is the one point compactification of (M = 
M is M is compact). Then $8 is a compact metrizable space. In order to construct 
W,,, we construct a positive linear functional E,,, : C(98) — R on the space 
C$) of real valued continuous functions on ‘{, satisfying E, (1) = 1. 

One motivation is that, given 0 < t; <--- < ty, (rational) and given Borel sets 
E; C M, we want to arrange that the probability that a path, starting at xo at time 
t = 0, lies in E; at time t;, for each j € {1,..., k} is 


(11.28) foe fied = taste, na) hts, 0) AV (eg) Va). 
Ey Ex 


To implement this, we define E,, on the space C* consisting of continuous func- 
tions on $B of the form 


(11.29) p(w) = F(w(t1),...w(th)), ti<-+++ <tr, 


where Ff is continuous on ins M and t; € QT, to be 
(11.30) 


Exo (¥) = foo fi r(tr,cr,co)ite = trtast1) Nt = thas te) 


F(a1,...,0~) dV (ax) +++ dV (a1). 


If y(w) in (11.29) actually depends on w(t,) for some proper subset {t,} of 
{ti,...,tx}, there arises a formula for E,,,(y) with a different appearance from 
(11.30). The fact that these two expressions are equal follows from the identity 


(11.31) [ree x,y)h(s,y,z) dV(y) = h(t + 5,2, z). 


From this it follows that E,,, : C# — R is well defined. It is also a positive linear 
functional, satisfying E,,,(1) = 1, this last identity thanks to (11.11). 

Now, by the Stone-Weierstrass theorem, C # is dense in C (98). It follows that, 
for each xp € M, E,,, has a unique continuous linear extension to C(%8), pre- 
serving these properties. We therefore have: 


Theorem 11.5. Assume M is a complete Riemannian manifold, with heat semi- 
group e'S, and that (11.7) holds (yielding (11.11)). Then, given xq € M, there is 
a unique probability measure W,,, on $B such that (11.30) is given by 


496 11. Brownian Motion and Potential Theory 
(11.32) Bas(¢) = f ow) dWr9(w), 

ot 
for each p of the form (11.29) with F continuous on IL; M. In such a case, 
(11.30) and (11.32) then hold for yp as in (11.29), for each bounded Borel function 
F’, and also for each positive Borel function F' on Tl; 
Stochastic continuity. Let us set 
(11.33) X:: BOM, Xi(w)=(t), for te Qt. 
We want to investigate when X, and X,, are “close,” for |s — t| small. To do this, 
we define Moo(®, M) to be the set of equivalence classes of Borel measurable 
maps from 8 to 1/7, with 


(11.34) p~ w provided y(w) = )(w) for Wz,-a.e. w. 


This is a metric space, with distance function 


(11.35) Dao) = fae) )) dW (w), 


where we set 


(11.36) d(x,y) = p(d(a,y)), p(r) =1-e77. 
If M is not compact, dis discontinuous at (00, 00), but this is a Borel function on 


M x M. Now, if s,t € Qt and0 < s <t, then 
(11.37) 


Di C8 Si | d(w(t),w(s)) dW, (w) 


| h(s,@1,@0)h(t — 8, x2, 21)d(x1, 22) dV (21) dV (a2) 


= f nis, 1,29) e’-9)4d,, (#1) dV (x1), 
M 


where we set 
(11.38) d,.(y) = d(x, y). 


Hence 


11. Diffusion on Riemannian manifolds 497 


(11.39) Dz (Xi,Xe) < sup e 4.d,, (a1). 


Ly 
This yields the following result. 
Proposition 11.6. Take M as in Theorem 11.5, and assume s,t € Q*. Then 
(11.40) Dag (Xt, Xs) < B(|s — t)), 
where, for t > 0, 


(11.41) O(t) = sup e’4d,(z). 
ceEM 


We combine this with the following elementary result. 


Lemma 11.7. The space M,,(%, M) with distance function (11.35) is a com- 
plete metric space. 


Proof. If (y;) is Cauchy in M,,, (B, M), it has a subsequence (~,;) such that 
Dao (hj, Vite) S47. 
Then >7j30 d(b;(w), bj41(w)) < 00 for W,,-a.e. w. Hence (7);) converges 


pointwise a.e., say ~; — w. The dominated convergence theorem implies 
Dz. (vj, ) — 0. Hence Dz, (p;,~) > 0. 


With this in hand, we have the following. 
Proposition 11.8. Take M as in Theorem 11.5. Assume in addition that 
(11.42) o(t) 0 as t\,0, 


for 0(t) defined as in (11.41). Then the map t ++ X, defined above for t € Qt 
has a unique continuous extension to 


(11.43) R* + M,,($,M), tro X. 
One has (11.40) for all s,t € Rt. 


Note that if 17 = R”, then (11.41) yields 


(11.44) U(t) < (ane)-0/? f ert dx = Ct/?. 


R” 


Here is a natural extension. 


498 11. Brownian Motion and Potential Theory 


Proposition 11.9. Assume the Riemannian manifold M has C° bounded geom- 
etry. Then )(t), defined by (11.41), satisfies 


(11.45) w(t) < C(M)t/?. 


Recall that the notion of 17 having C’°° bounded geometry was defined right 
below (11.21). 

Note that (11.44) is much stronger than what is needed to verify (11.45) in the 
case M = R”. More generally we have, for p € [1, 00), 


(11.46) (4nt)-"/? / ell" /4t oP da = CrP/?, 
R” 
This motivates the following. For p € [1, 00), set 
1/p 


C147 DE, (g,8) = [f (p(w), Ww))” dry w) 
p 


Note that here we use d(x, y) rather than d(a, y). In place of (11.37) we have, for 
s,tEQt,s<t, 


DP (X1,Xq)? = / d(w(t),w(s))? dW, (w) 


(11.48) ” 
_ / h(s, a1, 20)e¢-9)4d2, (21) dV (201), 
M 
hence 
(t—s)A 1/p 
(11.49) D? (XuX.) < [sup e db («)| 
where 


(11.50) By) =d(a,y)?, ea (a) = / h(t, x, y)d(x, )? dV(y). 
M 


This establishes the following counterpart of Proposition 11.6. 
Proposition 11.10. In the setting of Proposition 11.6, for s,t € QT, 1 < p<, 
(11.51) DE (Ag Xg) = Uy( ls =), 


xo 


where, for t > 0, 


11. Diffusion on Riemannian manifolds 499 


1/ 
(11.52) d,(t) = [sup eta? (x). 
ceEM 


To proceed, we investigate the t-dependence, in various function spaces, of 
(11.53) Fi(w) = f(Xi(w)), 


given f : M — R. Asa first estimate of this sort, we assume f is Lipschitz on M 
and use 


(11.54) |Fi(w) — Fs(w)| < Lip(f) d(Xt(w), Xs(w)). 
Taking the pth power and integrating, we have 
(11.55) 
PRO) ~ Fo) AWay(e) < Lil 4? fa(X(w), Xa(w))? dW, (w), 
p p 
hence 
Fi — F|| pp < Li DP (X4, Xs 
(11.56) | t Ix (B,Wag) p(f) al t ) 


for p € [1, 00). With this, we can establish the following. 


Proposition 11.11. Assume M has the property that ),(t) + 0 ast \, 0. Take 
f € C(M), and define F,, by (11.53). Then F;, is a continuous function of t € 
[0, 00) with values in L? (3B, Wz,). 


Proof. Given < > 0, write f = fo + fi, with fo € Lip(), sup|fil| < ¢. Then 


(11.57) Fe — Fs||zeop.w,,) < Lip(fo)¥p(|s — t]) + 2sup | fil. 
Hence 
(11.58) lim sup ||F; — Fs \| x (8,Weo) <2e, Ve>Q0O, 
tvs 
yielding the result. 


Feynman-Kac formula. We extend some of the results of §2 to the manifold 
setting. Thus, consider 


(11.59) a =Au-—Vu, u(0)=f, 


with 


500 11. Brownian Motion and Potential Theory 


(11.60) f,VeC(M). 


We assume MM satisfies the hypotheses of Theorem 11.5 and Proposition 11.10, 
and that, for all p € [1,00), J,(t) + 0 ast \, 0, so Proposition 11.11 holds. As 


in §2, we use the Trotter product formula to write 


k 
(11.61) et(A-V) f = Jim (eRe tlhe - 
with 

k 
(11.62) (c@/P4_-t/HIv) f(z) — J ex) dW,,(w) — Ex(y~x), 
p 
where 
(11.63) p(w) = F(Xz(w))e~ 5"), 
and 
t k 

gs Sk=7 2 Feri F, = V(X;). 


Proposition 11.11 applies, to give 
(11.65) F, continuous in 7, with values in L? (8, W,.). 


It follows that 
t 
(11.66) S_p— | V(X,) dr in L?($, W,,) norm, Vp < co. 
0 
We also have pointwise bounds: 


(11.67) sup |.S;| < tuo = t sup |V]. 
B M 


We hence have 
(11.68) ye —> f (Xen Jo V+) 4 in £9( 58, W,,) norm, Vq < 00. 


This gives the following conclusion. 


Proposition 11.12. Assume M satisfies the hypotheses of Theorem 11.5 and 
Proposition 11.10 and that 0,(t) + 0 ast \, 0, for all p € [1,0o). Assume 


f and V satisfy (11.60). Then, for each x € M, t > 0, 


11. Diffusion on Riemannian manifolds 501 


(11.69) AV) F(a) = Ha ( f(Xee BVO ar). 


Diffusion with drift. We turn to semigroups arising as solution operators to 


Ou 
11.7 —=A xX 
( 0) DE ut+t Xu, 


where A is the Laplace Beltrami operator on a complete Riemannian manifold 7 
and X is a real vector field on 1/7. We start with the assumption that X is bounded 
and continuous, say ||X|| < C. In such a case, if g € C§°(M), 

(U7) ||Xglli2 < C*llValliz = C°(—Ag,g) < C*l|Aglzallallz2. 


Since completeness implies C§° (1) is dense in D(A), we have (11.71) for all 
g € D(A), hence, for f € L?(M), t > 0, 


(11.72) [Xe fllZ2 < O7||Ae fllralle* fina < C7 "II fllz2. 


By the result of Exercise 10 in §9 of Appendix A (Outline of Functional Analysis), 
we have: 


Proposition 11.13. [f M is complete and X is a real vector field that is continu- 
ous and bounded, then D(X) C D(A) and A+ X, with domain D(A), generates 
a strongly continuous semigroup on L?(M). 

To proceed, we augment our hypotheses on _X: 


(11.73) X is C’, with a bound ||V.X'|| < L. 


Hence X generates a global flow F' by C+ diffeomorphisms on M, giving rise to 
a group of operators, 


(11.74) et f(a) =f Fia) 
Note that, for g € C§°(M), 
(11.75) v(x) = e* g(x) => — 


hence 


502 11. Brownian Motion and Potential Theory 


‘ i qa aay / Kitz) Vodts) dV(z) 
(11.76) 
= / ie K@teaave, 


with div X defined as in Chapter 2, §2. In addition, for a € R, 
d at t at i t 
(11.77) He g(F’x) dV(x) =e (a — div X)g(F'ax) dV(x), 
M M 
hence, for g € C§°(M), g > 0, t>0, 
(11.78) a—dvx <o—se" f (Ftx)av < f g(x) av. 
M M 


A limiting argument gives this for all non-negative g € L1(M). Taking f € 
L?(M), g = |f|?, we have: 


Proposition 11.14. Assume X is a bounded vector field on M satisfying (11.73). 
Pick a € R such that 


(11.79) god xX <0 
Then, for p € [1, 00), 
(11.80) le tie =i: Yee, 


and B = X + (a/p)I generates a strongly continuous contraction semigroup on 


L?(M). 


We can now apply the Trotter product formula, as presented in Appendix A to 
this chapter, to establish the following. 


Proposition 11.15. Under the hypotheses of Propositions 11.13—11.14, we have 
that A+ X +(a/2)I generates a contraction semigroup on L?(M) and, fort > 0, 


k 
(11.81) gare ly = jim (cf/Ae(t/4)x) f, in L?-norm, 


forall f € L?(M). 


Note that, for f € L?(M), t > 0, 


11. Diffusion on Riemannian manifolds 503 


0<f<1s0<e/HAF <1and0 < e/X Ff <1, so 


(11.82) pr) pce. 


Let us denote the operator on the right side of (11.81) by Px(t). We have, for 
p € [1, co), 


f € L'(M)nL®(M) 
(11.83) = le" fille < [flix lle* fille < ef [Loe 

= |Pe(t)fllze < eI fll, 
hence 
(11.84) PA Fllce < eCPM Fle, 
for f € L1(M)N L©(M). Since L1(M)M L®(M) is dense in L?(M), (11.84) 
extends to all f € L?(M), for each p € [1, 00). This leads to the following result, 
extending Proposition 11.1. 
Proposition 11.16. Let M be a complete Riemannian manifold and X a bounded 
vector field satisfying (11.73). Then {ears : t > 0} is a strongly continuous 


semigroup on L?(M), for each p € [1, 00). 


Proof. To start, we take f, g € C§°(M) and write 


t 
(11.85) (cA+X) ¢ — Fg) =| (e8At (A + X)f,g) ds. 
0 


The integrand is continuous in s, and we have the pointwise bound 

(11.86) [(e%4 f,g)| <eC/P (A+ X)fllzeligliz, for L< p< oo. 
Hence, for t € (0, 1], 

(11.87) (eXO* F— Fg) < Ctl(A+X)fllzllgllee, 1S pS oo, 
so 

(11.88) [let F — file < Ctl|(A+X)fllze, 1<p<oo, 


for all f € C§°(M). If 1 < p < co, C§(M) is dense in L?(M), and the uniform 
operator bound (11.84) yields the desired strong continuity on L?(M). 


It is also useful to consider the adjoint semigroup: 


(11.89) P(t) = e(4+%) —» P(t)* = eA-X-Y), ov =div X. 


504 11. Brownian Motion and Potential Theory 


Compare (2.22) of Chapter 2. It follows from Proposition 11.16 and material of 
§9 in the Outline of Functional Analysis appendix (cf. Exercise 9) that P(t)* is a 
strongly continuous semigroup on L?(M/) for p € (1, 00). We complement this: 


Proposition 11.17. The family {P(t)* : t > 0} maps L'(M) 2 L?(M) to itself 
and extends uniquely to a strongly continuous contraction semigroup on L1(M). 


Proof. We have from (11.80) that 

(11.90) lle“AY Flaw < eT Fle, 

for p € (1,00). In addition, 

(11.91) Je U(-X—V) F(z) =f fe dV (a 
M 


by an argument parallel to (11.76), which also yields (11.90) for p = 1. Taking 
p = 2, we have as in Proposition 11.15 that 


k 
(11.92) et(A-X-V) p — dim (clt/Aelt/W-X-V)) f, 


in L?-norm, for all f € L?(M). Denote the operator on the right side of (11.92) 
by P(t). We have, for p € [1, 00), 


f € L'(M)nL®(M) 
(11.93) = |e fllze < [[fllze, lleO*— fllze < eI Fl ze 
= ||P fllze < e~@?™ | fle. 
Hence (11.92) gives 
(11.94) PO fila» < eI Fle, 
for f € L1(M)N L®(M), p € [1, 00). Since L1(M) M L°(M) is dense in 
L?(M), (11.94) extends to all f € L?(M), for each p € [1, 00), and in particular 


for p = 1. Hence P(t)* extends to a contraction semigroup on L'(/). Strong 
continuity follows as in Proposition 11.16. 


By duality, we have a natural extension of P(t) to 
(11.95) gers) T(E), 


consistent with (11.82). Arguing as in Proposition | 1.2 also leads to the following. 


11. Diffusion on Riemannian manifolds 505 


Proposition 11.18. In the setting of Proposition 11.16, we have for u(t) = 
e(A+X) F that 


(11.96) f €C.(M) = u € C((0, 00), B°(M)). 
We next establish the following extension of (11.7) and (11.11). 
Proposition 11.19. Assume the complete Riemannian manifold M has the prop- 


erty that (11.7) holds for f © C§°(M) (hence (11.11) holds). Take X as in 
Proposition 11.16. Then, for t > 0, 


(11.97) feL(mM) => [Porta = tev. 
M M 

and 

(11.98) eas ea 


Proof. If f € L'(M) 1 L?(M), then, under our hypotheses, 
(11.99) [ Phwsav = f tay, 
M M 
for each k. Having Proposition 11.17, we also have 
(11.100) Pr apse 2) S int sion, 
for all f € DI1(M). This gives (11.97), and (11.98) follows by duality. 
Heat kernel estimates on e!(4+), Here we establish Gaussian upper bounds on 
the integral kernel of e’”, with 
(11.101) L=A+Y, 
where Y is a first order differential operator, of the form Y f = X f+V f, X areal 
vector field, V a real valued function on M (or, as in (11.89), Y = —X — V). As 


usual, MM is acomplete Riemannian manifold. In addition, we make the hypothesis 
that 


(11.102) M has C® bounded geometry, 


i.e., (for some 6 > 0), in exponential coordinates on Bs(0) C T,M, we have C'° 
bounds on the metric tensor and its inverse, and we also assume 


506 11. Brownian Motion and Potential Theory 
(11.103) the coefficients of Y are C°° bounded on B5(0), 


independently of p € M. As for the estimate (11.6), we use wave equation tech- 
niques, such as done in [CGT], but in the present setting we do not have self 
adjointness of L, so further work is required. 

Regarding the spectral behavior of L, the analysis done above implies L gen- 
erates a strongly continuous semigroup e’”, with D(L) = H?(M). Adding a 
constant to L, we can arrange that 0 is in the resolvent set of L, so 


(11.104) LL ; E?(M) > H?(M). 


The solution operator to the wave equation we work with is W(t), defined 
initially on C§°(M) & C§°(M) by 


(11.105) W(t) (‘) = ear 
solving 
(11.106) =(*) - (7): u(0) = f, v(0) =g. 


Note that (11.106) implies 


Opu = Lu, u(0) = f, O,u(0) = 49, 


(11.107) 
0?u = Lv, v(0) =g, v(0) = Lf. 


The fact that u(t), u(t) € C§°(M) for all t follows by finite propagation speed, 
plus the hypothesis that AZ is complete. Under the hypotheses (11.102)—(11.103), 
energy estimates of the sort used in Chapter 6 imply that W(t) extends to a 
strongly continuous group of operators 

(11.108) W(t): H'(M)® L?(M) — H'(M) 6 L?(M). 

We have an operator norm bound 


(11.109) IW ®)\lcuner) < CeMl, 


Now W(t) is a2 x 2 matrix of operators, 


(11.110) Wi) = a é) 


with C’;(t) even in t and $5;,(t) odd in t, so 


11. Diffusion on Riemannian manifolds 507 


(11.111) s(Wwe) +W(-)] = ee sey 


with 
(11.112) Ci(t): H'(M) —> H'(M), Co(t) : L?(M) — L?(M). 
In both cases C;(t) f = u(t) satisfies 

(11.113) Ou=Lu, u(0) =f, Au(0) = 0. 


We have that C(t) is the unique continuous linear extension of C(t) from 
H'(M) to L?(M), and drop the notational distinction, obtaining 


(11.114) C(t): H'(M) — H'(M), L?(M) — L?(M), 
satisfying 

(11.115) IC@Mllcany, ICOllea2) < Ce". 

Note that 

(11.116) L=A=> C(t) =costv—A. 


Note that the identity W(t)W(s) = W(t+ s) implies 


(11.117) C(t + s) = C(t)C(s) + So1(t)S10(s). 
Similarly 

(11.118) C(t — 8) = C(t)C(s) — Sor(t)Si0(s), 
(11.119) C(t +s) +C(t—s) = 2C(t)C(s). 


This identity will be useful below. 
We now define a functional calculus. Assume 


(11.120) y(A) = y(—A), and |G(t)| < Ce", K > M. 


Then we set 


(11.121) y(V—L) = =f. p(t)C(t) dt, 


508 11. Brownian Motion and Potential Theory 
a strongly convergent integral in view of the operator estimates (11.115). 


Here are some basic properties of such a functional calculus. First, if y satisfies 
(11:120), 


Ys(A) = (cos sd) p(A) 


= galt) = 5[0e +9) + ole -9)] 
(11.122) = vlV=D) = 5 ef [C(t — s) + C(t+s)] dt 


= a i * B(t)C(s)C(t) dt 


= C(s)p(v—L), 


the second-to-last identity by (11.119). If ~(A) is even and satisfies a similar esti- 
mate, we have 


(we) = HAYA) = om i B(E) cos At dt y() 


(11.123) 1 ; 
— SE / w(t)pr(A) dt, 
and hence 
1 oa 
(VHD) == fd eV/=Dat 
1 oo 
oe === | seta ev=1) 
= V(V—-L)p(v—L) 
For a related identity, suppose y(A) satisfies (11.120) and also 
(11.125) |OP A(t)| < Ce“*M. 
Then 


(A) = (A) > H(t) = —G? A(t) 
=> W(V-L) = -Ly(V-1), 


since C(t) f = u(t) solves (11.113). 
We prepare to tackle the heat semigroup, by considering, for s > 0, 


(11.126) 


(11.127) E,() =e", 


11. Diffusion on Riemannian manifolds 509 


which satisfies 


a 1 2 

11.128 Ef) =e i, 

( ) (t) Js 

and gives rise to the family of operators 

1 Co 

(11.129) E,(V—L) = ol e-”/48C(£) dt. 
Noting that 
(11.130) 0,8(A) = —7B, (A), 


we have from (11.126) that 


O 


(11.131) ms 


E,(V—L) = LE,(V—L). 
Meanwhile, (11.129) directly gives 

(11.132) E,(/—L)f — f in L?-norm, as s \, 0, 

for each f € L?(M). Hence E,(./—L) = e8”. We have the identity 


1 


11.1 slp _ 
ee ee J/4ns 


/ e/48C(t) f dt, 


fors >0, f € L?(M). 
We are prepared for integral kernel estimates. To start, suppose y(A) satisfies 
(11.120) and let U;, and U2 be disjoint open sets, satisfying 


(11.134) x; € U; = d(x, 22) > p. 
Consequently, thanks first to finite propagation speed and then to (11.109), 


fe BW) > AV-Bsly,= se | COCUSy, 


|t|>p 


4 GD livin 26 if Io(e)|eM"" dt | Fllne. 


|t|>p 


(11.135) 


Here the hypothesis f € L?(U,) means f € L?(M) and supp f Cc Uj. If also 
d2™ (A) satisfies (11.120), then 


510 11. Brownian Motion and Potential Theory 


(11.136) 
fe L(V) > |L"e(V-L)fllnzw,) <C J |GC™HleM4 de- | fllnz- 


|t|>p 


The integrals that arise for heat kernel estimates are 
oe Mt C = —t?/4s Mt 
(11.137) E,(t)e* dt = — e€ edt, 
p V5 Jp 


and, more generally, 
(11.138) / |02" B,(t)|eM* dt = Im(M, s, p), 
p 


so, when (11.134) holds, 

(11.139) fe DPUs) => Le" fllzza) < CIm(M, s, p)|lfliz2- 
To proceed, take x,y € M and say 

(11.140) d(x, y) = p+ 26, 


with 6 as in (11.103). Take U; = B;(y), Uz = Bs(x). With n = dim M, pick 
k > n/2, and write 


(11.141) by =I fyt gus fy Gy © L*(Boly)); 

with L?-norms < Cs. Thus, 

(11.142) oj = Tee, ee gy. 

By elliptic regularity and Sobolev embedding we have (with the same C's) 
(11.143) fe*6y(x)| < C5[LNe*"5y|lz2cB5¢e + lle?” Syllacwscey)] 
followed by 


ciaagy Pe" 6ullz2 caste) S Ee Filla escen + Xe gylnecas(ay 
< Cslon(M, S, p) + CsIk(M, 5, p), 
and an analogous estimate on ||e*”6,|| ,2(8;(<)), yielding 


(11.145) le" dy(a)| < CZ S~ I;(M, s, p). 
52k 


11. Diffusion on Riemannian manifolds 511 


Having this heat kernel estimate, we can now complement Propositions | 1.18- 
11.19 with the following extension of Proposition 11.3 and Corollary 11.4. 


Proposition 11.20. Assume M has C'°° bounded geometry and that (11.103) 
holds, with Y = X. Then 


(11.146) eat) OM) — CM) 
and 
(11.147) e(4+%) : C(M) — C(M) 


are strongly continuous semigroups. 


Proof. Given that (11.95) is a contraction and given (11.96) and (11.98), it suf- 
fices to show that 


(11.148) eihr*) OM (Mt) —+-0,(M). 


This follows directly from (11.145), in the same fashion that (11.25) follows from 
(11.22). 


Wiener measure for diffusion with drift. Having Propositions 11.18, 11.19, and 
11.20, we are in a position to extend the construction of Wiener measure on SB = 
[leg+t M done in (11.27)-(11.31) in the setting of the diffusion semigroup em, 
In the expanded setting of e(A+X) | we use the same formulas (11.28)-(11.30), 
this time with h(t, x, y) given by 


(11.149) elAtX) F(z) = ix x,y) f(y) dV(y). 
M 


Again (11.30) produces a positive linear functional E,, : C* — R, satisfying 
E,,,(1) = 1, and we have the following variant of Theorem 11.5. 


Proposition 11.21. Assume M is a complete Riemannian manifold for which 
(11.7) holds and that X satisfies the conditions of Proposition 11.19. Then the 
conclusion of Theorem 11.5 holds, with h(t, x,y) as in (11.149). 


Having the heat kernel estimate (11.145), we can also extend the results on 
stochastic continuity given in Propositions 11.6—11.11. 


Relativistic diffusion. Results we have derived on diffusion so far in this chapter 
have been decidedly non-relativistic. A Brownian path x(t) as described in §1 is 
nowhere Lipschitz, so does not even have finite velocity, much less below light 


512 11. Brownian Motion and Potential Theory 


speed. However, there is a relativistic theory of diffusion. We present a simple 
case here. 

To start, we recall from §7 that the Langevin theory of Brownian motion actu- 
ally has the velocity u(t) = «x’(t) satisfy a stochastic equation, 


(11.150) dv = dw — Bvdt, 


where (3 is a friction. In case G = 0, we have v(t) = w(t). In this model, the 
velocity is at least finite, but it has no upper bound. A further wrinkle is needed to 
model relativistic diffusion. 

To see how it works, we recall from §18 of Chapter 1 how a path x in 
Minkowski space is parametrized by proper time 7, yielding the 4-velocity 


(11.151) u= = = (1,2), 
satisfying 

(11.152) (u,u) =—1, 
1.€., 

(11.153) - : 


0 de TPR 
Then wu is a path in the one-sheeted hyperboloid 
(11.154) H = {(t,2) € R*: —# + |z|? = -1,¢ > O}. 


Given the metric tensor induced from the Lorentz metric on R*, H. is 3D hyper- 
bolic space, of constant sectional curvature —1. In case of zero friction, we 
therefore take u(r) to be given by the hyperbolic space analogue of the Wiener 
process, so 


(11.155) u(t) = X, € L?(B, Wu), 


depending continuously on 7, where 


(11.156) p= [] %, 


TEQr 


and W,,, is the Wiener measure on 58 constructed in Theorem 11.5, with ev 
the heat semigroup generated by the Laplace-Beltrami operator A on H, and xo 
replaced by uo. More generally, we can scale the rate of diffusion and replace e7“ 
by e°74 for some constant ~ > 0. As seen in Chapter 8, §5, on 3D hyperbolic 


11. Diffusion on Riemannian manifolds 513 


space the heat kernel is given by 
(11.157) h(r,@,y) = (Amr) /2e-7 ear, r= d(x,y). 


With the random 4-velocity so specified, the actual random path x(r) in 
Minkowski space is given by 


(11.158) x(T) = 2(0) + iE u(c) do, 


where x(0) is the initial position. 

We now consider relativistic diffusion with friction. A straightforward ana- 
logue of the stochastic differential equation (11.150) will not work, since the 
4-velocity wu is not running over a linear space. For inspiration, we look to §10, 
Exercise 5, which shows that the equation (11.150) is associated to a diffusion 
equation Of /Ot = Lf, where, for n = 1, 


(11.159) L= => — pu 

For Brownian motion on R”, we have 

(11.160) L=A-pv-V, 

where A = 5°(0/0v;)? is the Laplace operator on R” and —Gv-V = X isa 
vector field on R” generating a radial flow in toward the origin (given @ > 0). For 


relativistic diffusion, we bring in a diffusion process with drift on H, replacing 
the heat semigroup e7 by a semigroup of the form 


(11.161) ev?) L=A+4+X, 

where A is the Laplace Beltrami operator on H and X is a vector field on X. 
In analogy with (11.160), we pick a distinguished point u; € H (perhaps u, = 
(1,0,0,0), for which the classical component of the velocity vanishes) and take 
X to generate a flow toward u,. A natural analogue of the vector field in (11.160) 
iS 


(11.162) X=-—GegradV, V(u) = v(d(ui,u)), 


for example with 


(11.163) v(r) 


514 11. Brownian Motion and Potential Theory 


In such a case, {e™’ : + > 0} is a strongly continuous, positivity preserving 
semigroup on L?(H), for p € [1, 00), also acting on C(H), and satisfying e7’ 1 = 
1. 


More closely parallel to (11.160) would be u(r) = r?, which goes beyond the 
results developed here on diffusion with drift. For a discussion of that case, as it 
relates to relativistic diffusion, see [Hab]. 

Relativistic diffusion has also been studied on non-flat spacetimes. See in par- 
ticular [FJ] for a treatment on Schwarzschild spacetimes. 


A. The Trotter product formula 


It is often of use to analyze the solution operator to an evolution equation of the 
form a 
7 = Au+ Bu 

in terms of the solution operators e’4 and e’?, which individually might have 
fairly simple behavior. The case where A is the Laplace operator and B is multi- 
plication by a function is used in §2 to establish the Feynman—Kac formula, as a 
consequence of Proposition A.4 below. 

The following result, known as the Trotter product formula, was established 
in [Tro]. 


Theorem A.1. Let A and B generate contraction semigroups e‘4 and e‘®, on a 
Banach space X. If A + B is the generator of a contraction semigroup R(t), then 


(A.1) R(t)f = lim (eens een)" 7, 


n—- oo 


forall f © X. 


Here, A + B denotes the closure of A+ B. A simplified proof in the case where 
A+ B itself is the generator of R(t) is given in an appendix to [Nel2]. We will 
give that proof. 


Proposition A.2. Assume that A, B, and A+B generate contraction semigroups 
P(t), Q(t), and R(t) on X, respectively, where D(A+B) = D(A)ND(B). Then 
(A.1) holds for all f © X. 


Proof. It suffices to prove (A.1) for f € D = D(A+ B). In such a case, we have 
(A.2) PUR)Qh)F— f= h(A+ B)f + o(h), 


since P(h)Q(h) f — f = (P(h)f — f) + P(A)(Q(h) f — f). Also, R(h) f — f = 
h(A+ B) + o(h), so 


A. The Trotter product formula 515 
P(h)Q(h)f — R(h)f = o(h) in X, for f € D. 


Since A + B is a closed operator, D is a Banach space in the norm ||f||p = 
(A+ B)f|| + || f|]- For each f € D, h7!(P(A)Q(h) — R(A)) f is a bounded set 
in X. By the uniform boundedness principle, there is a constant C’ such that 


= ||P(h)Q(h)F — R(h) fl] < Cllfllo, 


for all h > 0 and f € D. In other words, {h~*(P(h)Q(h) — R(h)) : h > O} is 
bounded in £(D, X), and the family tends strongly to 0 as h > 0. Consequently, 


r||PCAQ(H)F — RC) f|] 0 


uniformly for f is a compact subset of D. 
Now, with t > 0 fixed, for any f € D, {R(s)f : 0 < s < t} is a compact 
subset of D, so 


(A.3) || (P(A) Q(h) — R(A)) R(s) f|| = o(h) 


uniformly for 0 < s < t. Set h = t/n. We need to show that (P(h)Q(h))"f — 
R(hn)f — 0, as n — ov. Indeed, adding and subtracting terms of the form 
(P(h)Q(h))) R(hn — hj), and using || P(h)Q(h)|| < 1, we have 


|| (P@MQ(h))" fF — Rin) || 
ee R(h)) R(A(n — 1) f| 
+ || (POA)Q(A) — R(A)) R(A(n = 2))f|| 
+ +++ + ||(P(h)Q(h) — R(A)) FI]. 


(A.4) 


This is a sum of n terms that are uniformly o(t/n), by (A.3), so the proof is done. 


Note that the proof of Proposition A.2 used the contractivity of P(t) and of 
Q(t), but not that of R(t). On the other hand, the contractivity of R(t) follows 
from (A.1). Furthermore, the hypothesis that P(t) and Q(t) are contraction semi- 
groups can be generalized to ||P(t)|| < e%, |Q()|| < e&.If C= A+B 
generates a semigroup R(t), we conclude that || R(t)|| < e(¢+>*. 

We also note that only certain properties of S(h) = P(h)Q(h) play a role in 
the proof of Proposition A.2. We use 


(A.5) S(h)f —f=hCf+o(h), feD=D(O), 
where C is the generator of the semigroup R(h), to get 


(A.6) S(h)f —R(h)f =o(h), feD. 


516 11. Brownian Motion and Potential Theory 


As above, we have h~1||S(h)f — R(h)f|| < C|lf||p in this case, and conse- 
quently h~'||S(h) f — R(h)f|| — 0 uniformly for f in a compact subset of D, 
such as {R(s)f : 0 < s < t}. Thus we have analogues of (A.3) and (A.4), with 
P(h)Q(h) everywhere replaced by S(h), proving the following. 


Proposition A.3. Let S(t) be a strongly continuous, operator-valued function of 
t € [0,00), such that the strong derivative S'(0)f = Cf exists, for f € D = 
D(C), where C generates a semigroup on a Banach space X. Assume ||S(t)|| <1 
or, more generally, ||S(t)|| < e°. Then, for all f € X, 


(A.7) Cf = lim S(n—1t)" f. 


This result was established in [Chf], in the more general case where S’(0) has 
closure C’, generating a semigroup. 

Proposition A.2 applies to the following important family of examples. Let 
X = L?(R"), 1 < p< _o,orlet X = C,(R”), the space of continuous functions 
vanishing at infinity. Let A = A, the Laplace operator, and B = —My, that 
is, Bf(x) = —V(«) f(x). If V is bounded and continuous on R”, then B is 
bounded on X, so A — V, with domain D(A), generates a semigroup, as shown 
in Proposition 9.12 of Appendix A. Thus Proposition A.2 applies, and we have 
the following: 


Proposition A.4. If X = L?(R”"), 1 < p < ow, or X = C,(R"), and if V is 
bounded and continuous on R”, then, for all f € X, 


(A.8) gAoV) * = iin (clt/me-Cinv)" - 


This is the result used in §2. If X = D?(IR"), p < co, we can in fact take 
V € L™(R”). See the exercises for other extensions of this proposition. 

It will be useful to extend Proposition A.2 to solution operators for time- 
dependent evolution equations: 


(A.9) — =Au+B(t)u, u(0)=f. 


We will restrict attention to the special case that A generates a contraction 
semigroup and B(t) is a continuous family of bounded operators on a Banach 
space X. The solution operator S(t, s) to (A.9), satisfying S(t, s)u(s) = u(t), 
can be constructed via the integral equation 


t 
(A.10) u(t) =e4p+ f e—)4 B(s)u(s) ds, 
0 


parallel to the proof of Proposition 9.12 in Appendix A on functional analysis. We 
have the following result. 


A. The Trotter product formula 517 


Proposition A.5. If A generates a contraction semigroup and B(t) is a continu- 
ous family of bounded operators on X, then the solution operator to (A.9) satisfies 


(A.11) S(t,0)f = lim (cfm) elt/n)B{(n—1}t/n) | -_ (ena elt/n)B(0)) - 


for each f © X. 


There are n factors in parentheses on the right side of (A.11), the 7th from the 
right being e(t/n)Ae(t/n) B(G-1)t/n) e 

The proof has two parts. First, in close parallel to the derivation of (A.4), we 
have, for any f € D(A), that the difference between the right side of (A.11) and 


(A.12) e(t/n)(A+B((n—1)t/n)) ,,, e(t/n)(A+B(0)) ¢ 


has norm < n- o(1/n), tending to zero as n — oo, for ¢ in any bounded interval 
(0, T]. Second, we must compare (A.12) with S(t, 0) f. Now, for any fixed t > 0, 
define v(s) on0 < s <tby 


dv j-1 fai j 
nce ae J t<sg< “t, =f. 
(A.13) = Av + B( - t\v, —t<s<2t v(0)=f 
Thus (A.12) is equal to u(t). Now we can write 
Ov 
(A.14) 7s = Av+ B(s)v+ R(s)v, v(0) =f, 


where, for n large enough, ||R(s)|| < ¢, for 0 < s < t. Thus 


(A.15) u(t) = S(t, 0) f +f S(t, s)R(s)u(s) ds, 


and the last term in (A.15) is small. This establishes (A.11). 
Thus we have the following extension of Proposition A.4. Denote by BC(R”) 
the space of bounded, continuous functions on R”, with the sup norm. 


Proposition A.6. If X = L?(R"), 1 < p < ow, or X = C,(R”), and if V(t) 
belongs to C'({0, 00), BC(R")), then the solution operator S(t, 0) to 


Ou 

> Au — V(t)u 
satisfies 

(A.16) 


S(t,0)f = lim (clt/miBett/m)Vm—It/n)) ae (clt/niOe-tt/m VO) f, 


518 11. Brownian Motion and Potential Theory 


forall f © X. 


To end this appendix, we give an alternative proof of the Trotter product for- 
mula when Au = Au and Bu(x) = V(a)u(x), which, while valid for a more 
restricted class of functions V() than the proof of Proposition A.4, has some 


k 
desirable features. Here, we define vz = (ete tin) f and set 


1 
(A.17) v(t) =e4e-“y,, fort=—+s5,0<s<-. 
n 


We use Duhamel’s principle to compare v(t) with u(t) = e(4-Y) f. Note that 
u(t) > Upyy ast 7 (k+1)/n, and for k/n <t < (k+1)/n, 


2 ANG sAy —sV 
(A.18) ECO 


= (A-V)u+ [V, 4 ]e9" ug. 


Thus, by Duhamel’s principle, 


t 
(A.19) v(t) = AY) f +f eM s(A-V) Ris) ds, 
0 

where 

oA) ,—-o0V k 1 
(A.20) R(s) =[V,e**]e~°" vg, for s = at O<o0< 7 
We can write [V, e”4] = [V, e”4 — 1], and hence 
(A.21) R(s) = V(e™ — 1)e-° up — (e74 — 1)Ve~" ug. 


Now, as long as 
(A.22) D(A — V) = D(A) = H?(R"), 
we have, for0 < y < 1, 


(A.23) le" )~ el ceases < CT), 


‘eceray,z2 
for 0 < t < T. Thus, if we take y € (0,1) andt € (0, T], we have for 


t 
(A.24) F(t) = | e—s)(A-V) Ris) ds, 
0 


the estimate 


Exercises 519 
t 
(A.25) Fle < cf (t — 8)-7 |R(s)||y—2 ds. 
0 


We can estimate || R(s)||;;-27 using (A.21), together with the estimate 


(A.26) |e“ ,2Co%, O<7S1 


_ 1 || p¢ea,zr-ay 


Since o € [0,1/n] in (A.21), we have 


I|R(s)|lq-27 < Cn (VIF llzz, 


(A.27) anaes 
eV) = ([IVllecei2y + [Vl etl le. 


Thus, estimating v(t) = u(t) at t = 1, we have 


(A.28) | (cO/mae-a/mv)" f—elA-¥) f 


an 5 Cv) Fle 077, 


for 0 < y < 1, provided multiplication by V is a bounded operator on H?7(R"). 
Note that this holds if D°V € L°(R”) for Ja] < 2, and 


(A.29) Wile) $C sup D°V a. 


a|<2 


One can similarly establish the estimate 


(A.30) (elev) "5 etla—v) j| 


pp = CMeMV IM fllze 2. 


Exercises 


1. Looking at Exercises 2-4 of §2, Chap. 8, extend Proposition A.4 to any V, continuous 
on R”, such that Re V (x) is bounded from below and |Im V (x)| is bounded. 

(Hint: First apply those exercises directly to the case where V is smooth, real-valued, 
and bounded from below.) 

2. Let H = L?(R), Af =df/dx, Bf = ixf(x), so e’4 f(x) = f(a +t), e? f(x) = 
e''* f(a). Show that Theorem A.1 applies to this case, but not Proposition A.2. Com- 
pute both sides of 

aris y = lim (e@/m)4e(a/n)B) "Tf, 
and verify this identity directly. 
Compare with the discussion of the Heisenberg group, in §14 of Chap. 7. 

3. Suppose A and B are bounded operators. Show that 


UAB) _ (elt/MA glt/mByMI Ct 


lle 
nm 


and that 


520 11. Brownian Motion and Potential Theory 


(Hint: Use the power series expansions for e 


ALB) (g(H/2MAGMB GIA < At 


n2 
, and so forth.) 


lle 


(t/n)A 


References 


[Az] 


[B] 
[BG] 


[CM] 


[CGT] 


[Chf] 
[CW] 
[Dav] 
[Db1] 


[Db2] 
[Db3] 


[DS] 
[Dur] 


[Ein] 
[El] 


[Em] 
[Ev] 


[FJ] 
[Fdin] 
[Fr] 


[Hab] 
[HP] 


[Ho] 


R. Azencott, Behavior of diffusion semi-groups at infinity, Bull. Soc. Math. France 
102(1974), 193-240 

D. Bell, The Malliavin Calculus, Longman, Essex, 1987 

R. Blumenthal and R. Getoor, Markov Processes and Potential Theory, Academic, 
New York, 1968 

R. Cameron and W. Martin, Evaluation of various Wiener integrals by use of certain 
Sturm-Liouville differential equations, Bull. AMS 51(1945), 73-90 

J. Cheeger, M. Gromov, and M. Taylor, Finite propagation speed, kernel estimates 
for functions of the Laplace operator, and the geometry of complete Riemannian 
manifolds, J. Diff; Geom. 17 (1982), 15-53. 

P. Chernoff, Note on product formulas for operator semigroups, J. Func. Anal. 
2(1968), 238-242 

K. Chung and R. Williams, Introduction to Stochastic Integration, Birkhauser, 
Boston, 1990 

E.B. Davies, Heat Kernels and Spectral Theory, Cambridge Univ. Press, Cam- 
bridge, 1989. 

J. Doob, The Brownian movements and stochastic equations, Ann. Math. 43(1942), 
351-369 

J. Doob, Stochastic Processes, Wiley, New York, 1953 

J. Doob, Classical Potential Theory and its Probabilistic Counterpart, Springer, 
New York, 1984 

N. Dunford and J. Schwartz, Linear Operators, Wiley, New York, 1958 

R. Durrett, Brownian Motion and Martingales in Analysis, Wadsworth, Belmont, 
CL, 1984 

A. Einstein, Investigations on the Theory of the Brownian Movement, Dover, 
New York, 1956 

K. Elworthy, Stochastic Differential Equations on Manifolds, LMS Lecture Notes 
#70, Cambridge University Press, Cambridge, 1982 

M. Emery, Stochastic Calculus in Manifolds, Springer, New York, 1989 

L. Evans, An Introduction to Stochastic Differential Equations, Amer. Math. Soc., 
Providence RI, 2013. 

J. Franchi and Y. Le Jan, Relativistic diffusion and Schwarzschild geometry, Comm. 
Pure Appl. Math. 60 (2007), 187-251 

M. Freidlin, Functional Integration and Partial Differential Equations, Princeton 
University Press, Princeton, NJ, 1985 

A. Friedman, Stochastic Differential Equations and Applications, Vols. 1 & 2, 
Academic, New York, 1975 

Z. Haba, Relativistic diffusion, Phys. Rev. E79 (2009), 021128 

E. Hille and R. Phillips, Functional Analysis and Semi-groups, Colloq. Publ. AMS, 
Providence, RI, 1957 

L. Hérmander, Hypoelliptic second order differential equations, Acta Math. 119 
(1967), 147-171 


[Hsu] 
[IkW] 


[Ito] 
[IMc] 


[Kac] 


[Kal] 
[KS] 


[K] 
[Kol] 


[Lam] 
[Law] 


[LL] 


[Lev] 
[Mal] 


[McK] 
[Mey] 

[Nel] 
[Nel2] 
[Nel3] 


[Ok] 
[Par] 


[Pet] 
[PS] 
[RaT] 
[RS] 
[Sch] 
[Si] 
[Stk] 


[Stk2] 


References 521 


E. Hsu, Stochastic Analysis on Manifolds, GSM #38, Amer. Math. Soc., Providence 
RI, 2002. 

N. Ikeda and S. Watanabe, Stochastic Differential Equations and Diffusion Pro- 
cesses, North Holland, Amsterdam, 1981 

K. Ito, On Stochastic Differential Equations, Memoirs AMS #4, 1951 

K. Ito and H. McKean, Diffusion Processes and Their Sample Paths, Springer, 
New York, 1974 

M. Kac, Probability and Related Topics in Physical Sciences, Wiley, New York, 
1959 

G. Kallianpur, Stochastic Filtering Theory, Springer, New York, 1980 

I. Karatzas and S. Shreve, Brownian Motion and Stochastic Calculus, Springer, 
New York, 1988 

T. Kato, Perturbation Theory for Linear Operators, Springer, New York, 1966 

A. Kolmogorov, Uber die analytishen Methoden in Wahrscheinlichkeitsrechnung, 
Math. Ann. 104(1931), 415-458 

J. Lamperti, Stochastic Processes, Springer, New York, 1977 

G. Lawler, Introduction to Stochastic Processes, Chapman-Hall, Boca Raton, FL, 
2006. 

G. Lawler and V. Limic, Random Walks: A Modern Introduction, Cambridge Univ. 
Press, Cambridge, 2010. 

P. Lévy, Random functions, Univ. of Calif: Publ. in Statistics 112)(1953), 331-388 
P. Malliavin, Stochastic calculus of variations and hypoelliptic operators, Proc. 
Intern. Symp. SDE, Kyoto (1976), 195-263 

H. McKean, Stochastic Integrals, Academic, New York, 1969 

P. Meyer, Probability and Potentials, Blaisdell, Waltham, MA, 1966 

E. Nelson, Operator Differential Equations, Graduate Lecture Notes, Princeton 
University, Princeton, NJ, 1965 

E. Nelson, Feynman integrals and the Schrédinger equation, J. Math. Phys. 
5(1964), 332-343 

E. Nelson, Dynamical Theories of Brownian Motion, Princeton University Press, 
Princeton, NJ, 1967 

B. Oksendal, Stochastic Differential Equations, Springer, New York, 1989 

E. Pardoux, Stochastic partial differential equations, a review, Bull. des Sciences 
Math. 117(1993), 29-47 

K. Petersen, Brownian Motion, Hardy Spaces, and Bounded Mean Oscillation, 
LMS Lecture Notes #28, Cambridge University Press, Cambridge, 1977 

S. Port and C. Stone, Brownian Motion and Classical Potential Theory, Academic, 
New York, 1979 

J. Rauch and M. Taylor, Potential and scattering theory on wildly perturbed 
domains, J. Func. Anal. 18(1975), 27-59 

M. Reed and B. Simon, Methods of Mathematical Physics, Academic, New York, 
Vols. 1,2, 1975; Vols. 3,4, 1978 

Z. Schuss, Theory and Applications of Stochastic Differential Equations, Wiley, 
New York, 1980 

B. Simon, Functional Integration and Quantum Physics, Academic, New York, 
1979 

D. Stroock, The Kac approach to potential theory I, J. Math. Mech. 16(1967), 
829-852 

D. Stroock, The Malliavin calculus, a functional analytic approach, J. Func. Anal. 
44(1981), 212-257 


522 11. Brownian Motion and Potential Theory 


[Stk3] 
[StV] 
[T] 
[T2] 
[T3] 
[T4] 
[T5] 
[Tro] 


[Ts] 
[UO] 


[Wal] 
[Wie] 


[Yo] 
[Yo2] 


D. Stroock, An introduction to the Analysis of Paths on a Riemannian Manifold, 
AMS, Providence RI, 2000 

D. Stroock and S. Varadhan, Multidimensional Diffusion Processes, Springer, 
New York, 1979 

M. Taylor, Scattering length and perturbations of —A by positive potentials, 
J. Math. Anal. Appl. 53(1976), 291-312 

M. Taylor, Estimate on the fundamental frequency of a drum, Duke Math. J. 
46(1979), 447-453 

M. Taylor, Pseudodifferential Operators, Princeton Univ. Press, Princeton, NJ, 
1981 

M. Taylor, Scattering length of positive potentials, Houston J. Math. 33 (2007), 
979-1003 

M. Taylor, Random walks, random flows, and enhanced diffusivity in advection- 
diffusion equations, Discrete and Cont. Dyn. Sys. 17 (2012), 1261-1287 

H. Trotter, On the product of semigroups of operators, Proc. AMS 10(1959), 
545-551 

M. Tsuji, Potential Theory and Modern Function Theory, Chelsea, New York, 1975 
G. Uhlenbeck and L. Ornstein, On the theory of Brownian motion, Phys. Rev. 
36(1930), 823-841 

J. Walsch, An introduction to stochastic partial differential equations, pp. 265-439 
in Ecole d’été de Probabilité de Saint-Fleur XIV, LNM #1180, Springer, New 
York, 1986 

N. Wiener, Differential space, J. Math. Phys. 2(1923), 131-174 

K. Yosida, Functional Analysis, Springer, New York, 1965 

K. Yosida, Brownian motion on the surface of the 3-sphere, Ann. Math. Stat. 20 
(1949), 292-296. 


i. ® 


Check for 
updates 


The 0-Neumann Problem 


Introduction 
Here we study a boundary problem arising in the theory of functions of several 


complex variables. A function u on an open domain 2 C C” is holomorphic if 
Ou = 0, where 


= Ou 
ye OZ; : 


Ou 1 / Ou Ou 
2: = + 4 é 
(0:2) OZ; 2 & in.) 


In the study of complex function theory on 2, one is led to consider the equation 
(0.3) Ou = f, 


with f = )> f; dz;. More generally, one studies (0.3) as an equation for a (0, q)- 
form u, given a (0,q + 1)-form f; definitions of these terms are given in §1. One 
is led to a study of a boundary problem for the second-order operator 


(0.4) =00 +09, 


the 0-Neumann boundary problem, which will also be specified in §1. While the 
operator LJ is elliptic, the boundary condition does not satisfy the regularity con- 
dition dealt with in Chap. 5. The solution to this boundary problem by J. J. Kohn 
[K1] thus marked an important milestone in the theory of linear PDE, as well as a 
significant advance in complex function theory. 

The way that (0.3) leads to the 0-Neumann problem is somewhat parallel to 
the way the deRham complex leads to the boundary problems for the Hodge 


© The Author(s), under exclusive license to Springer Nature Switzerland AG 2023 523 
M. E. Taylor, Partial Differential Equations I, Applied Mathematical Sciences 116, 
https://doi.org/10.1007/978-3-03 1-33700-0_12 


524 12. The 0-Neumann Problem 


Laplacian discussed in §9 of Chap.5. Appendix A to this chapter puts the deR- 
ham complex in a general context. Though appendices to chapters in this book are 
almost always put at the end of their chapters, we put this one at the beginning, 
since its role is to link the previous material on the Hodge Laplacian, particularly 
with absolute boundary conditions, to the material of this chapter. 

In §1 we introduce the 0 complex and the 0-Neumann problem, and show that 
it is not regular. §§2—4 are devoted to establishing replacements for the regular 
elliptic estimates established in Chap. 5, for (0, 1)-forms, under a “geometrical” 
hypothesis on 2, namely that it be “strongly pseudoconvex.” This notion is defined 
in §2, in the course of establishing an estimate of C. B. Morrey. In §3 we show 
how this leads to a “1/2-estimate,” to wit, an estimate of the form 


(0.5) Q(u, u) > Cllullzay, 


in a situation where a regular elliptic boundary problem would yield an estimate 
on ||u||3;1. We then define the Friedrichs extension £ of O, and show that it has 
compact resolvent. In §4 we produce higher-order a priori estimates, of the form 


(0.6) ull ert < CllLullae, 


assuming u € D(L) is smooth on ©. In §5 we establish the associated regularity 
theorem, that £~! : H*(Q, A°+) + H**1(Q, A°+). Following [KN], we use the 
method of elliptic regularization to accomplish this. 

In §6 we apply the results established in §§2—5 to solve (0.3), when f is a (0, 1)- 
form satisfying 0 f = 0 and smooth on 2 (assumed to be strongly pseudoconvex). 
We obtain a solution u € C'°°(Q) under these hypotheses. As a consequence, we 
show that such Q is a “domain of holomorphy”’; that is, there exist holomorphic 
functions on 2 that cannot be extended beyond any point of 02. 

In §7 we derive a formula for the orthogonal projection B of L?(Q) onto the 
subspace H(Q) of L?-holomorphic functions on Q, in terms of £~! acting on 
(0, 1)-forms, and we establish some consequences. We consider Toeplitz oper- 
ators, of the form Ty = BMy, on H(Q), for f € C(Q). We show that Ty is 


Fredholm if f] _ is invertible, and we briefly discuss the problem of computing 


the index of T's; this index problem is related to index problems considered in 
Chap. 10. 

In §§2-6 we concentrate on (0, 1)-forms, making use of this theory to study 
(0,0)-forms in §7. In §8 we study the 0-Neumann problem on (0, ¢)-forms for 
general g > 1. The main point is to extend Morrey’s inequality. Once this is done, 
it is routine to extend the arguments of 883-6. We also have in 88 an extension of 
results of §7 regarding the compactness of commutators of M;, for f € C(Q), 
with certain projections, namely the orthogonal projections onto the positive, neg- 
ative, and zero spectral subspaces of the relevant closed extension of 0 + don 


Das L7(2, A”). 


Introduction 525 


In 89 we discuss a method that provides an alternative to the sort of energy 
estimates done in §§2—5, namely reduction of the problem to an equation on the 
boundary, of the form 


(0.7) *g=f, 


where C1" is a first-order pseudodifferential operator on OQ. The operator (It is 
not elliptic; its principal symbol is > 0 and vanishes to second order on a ray 
bundle over OQ. We show that this operator is hypoelliptic when 2 is strongly 
pseudoconvex. In fact, we do this via energy estimates that are not completely 
different from those arising in earlier sections, though alternative approaches to 
the analysis of O* are mentioned. 

Up through §9, our attention is confined to the 0-complex on domains in C”. 
In §10 we analyze the 0-complex on a strongly pseudoconvex, complex manifold; 
in fact, we consider manifolds with an almost complex structure, satisfying an 
integrability condition, that can be stated as 


a2 


(0.8) a =0. 


By doing the estimates in this context, one is able to prove the Newlander— 
Nirenberg theorem, that an integrable, almost complex structure actually is a 
complex structure, admitting local holomorphic coordinates. (In fact, all this was 
done by Kohn in [K1].) 

At the end of this chapter are two additional appendices. Appendix B gives 
some complementary results on the Levi form, introduced in §2 in the course of 
deriving Morrey’s inequality and defining strong pseudoconvexity. Appendix C 
derives a result on the Neumann operator VV for the Dirichlet problem (for the 
Laplace operator), useful for the analysis in §9. Namely, we specify the principal 
symbol of V + /—Ax € OPS°(AQ), where Ax is the Laplace operator on 
X = 9, in terms of the second fundamental form of OQ © 1. 

Other methods have been applied to the 0-complex. We mention particularly 
the method of weighted L?-estimates, such as done in [AV, Hol], and [Ho3]. 
These methods also apply directly to general pseudoconvex domains (i.e., one 
can omit the “strong”), though they ignore detailed boundary behavior. Another 
approach to the 0-equation is given in [EMM]. There has also been considerable 
work on the 0-Neumann problem on various classes of weakly pseudoconvex 
domains, including particularly [Cat, Chr, FeK, K2], and [NRSW]. 

There is a very different approach to the 0-equation, making use of explicit 
integral kernels; see the survey article of [Kh], or [HP1]. 

As another offshoot of the study of the 0-complex, we mention the study of 
“CR manifolds” (of which the boundary of a complex domain is the simplest 
example); surveys of this are given in [Bog] and [Tai]. Also, there are studies of 
general overdetermined systems; see [Sp] for a survey. 


526 12. The 0-Neumann Problem 
A. Elliptic complexes 
We give a brief discussion of a setting in which the study of boundary problems 


for the Hodge Laplacian in 89 of Chap. 5 can be included. The Hodge Laplacian 
arises from the deRham complex, a sequence of maps 


(A.1) ee oe a ee 


3 


where d is the exterior derivative. Key properties are that d? = 0 and that the 
symbol sequence is exact, for each nonzero € € T**: 


(A.2) a 2 ABTS ARTERY ca:.2 
This implies that d*d + dd” is elliptic. 

More generally, consider a sequence of first-order differential operators 
between sections of vector bundles fF’; — M. For notational simplicity, we 
will use script (e.g., F;) to denote spaces of smooth sections of various vector 
bundles. Suppose we have maps 


(A.3) 0:Fe— Frii, 0% =0. 


Suppose the symbol sequence is exact; we are said to have an elliptic complex. 
Set 


(A.4) K=O; H=ORin, E=HOk, 
J J 


so we also have spaces of sections, £9, €;, and €. Using metrics on F; and M to 
define J* : F, + Fxr_1, we have 


(A.5) Dg r€& +81, Dy :& —> &, Dp=o+e, 
and we fit these together to form D : € > E. Since 3? = 0, we have 
(A.6) D*D = D? = 09* + 3°03 : Fy — Fr. 

Now the general Green formula implies 
(A.7) (D* Du, v) = (Du, Dv) + B(u, v), 


with boundary term 


1 
(A.8) B(u, v) = Fi / lon, (2, v)Douo, vo) + (Dy (a; v)Dyu1, v1) dS, 
OM 


A. Elliptic complexes 527 


where we write u € € as u = up + 44, uj € €;. For the sake of definiteness, let 
us take 


(A.9) Uwe Fe, ve Fy, 


for a fixed @. Then 


B(u,v) = = fio + I*)u, o¢o49+)(a,v)v) ds 
aM 

=- x [(ou, oo(a, v)v) + (J*u, oy- (x, v)0)] ds. 
aM 


(A.10) 


We rewrite this in two different ways, parallel to (9.1) and (9.2) of Chap. 5, 
respectively, namely 


(A.11) B(u,v) = : / [(ou, d9(x,v)v) + (o9(a, v)9"u,v)| ds 


aM 
and 
(A.12) B(u,v) = : / [ov (x, v)0u, v) + (0*u, o9+ (x, v)v)] ds. 
aM 


Thus there arise two boundary problems for D* D on F,, the generalization of the 
“relative” boundary condition (9.4): 


(A.13) og(x,v)u=0, o(z,v)0"u =00n 0M, 
and the generalization of the “absolute” boundary condition (9.5) of Chap. 5: 
(A.14) ogx(Z,V)U=0, oy«(x,v)Ju = 00n0M. 


In each case we have 3(u,v) = 0 provided u and v satisfy the boundary condi- 
tion. We remark that the “absolute” boundary condition (A.14) is often called the 
“abstract Neumann boundary condition.” 

Define Sobolev spaces Hj (M, Fy) in analogy with (9.11) of Chap.5, with 
b = Ror A; namely, u € H'(M, Fy) belongs to H}(M, Fy) if and only if the 
zero-order boundary condition in (A.13) (for 6 = R) or (A.14) (for 6b = A) is 
satisfied, and u € H?(M, Fy) belongs to H?(M, F,) if and only if both boundary 
conditions, in either (A.13) or (A.14), are satisfied. 


Lemma A.1. Given u € H}(M, Fy) and D* Du € L*(M, Fy), then B(u,v) = 0 
for all v € H}(M, F,) if and only if all the appropriate boundary data for u 
vanish (e.g., 09(x,V)0*u = 0 on OM, in case b = R). 


528 12. The 0-Neumann Problem 


Proof. We need to establish the “only if” part. Take the case b = R. To start the 
argument, pick 0 € C™ (a7, Hom(Fi-1, F2)) such that o(x) = o»(a,v) for 


x € OM. Then, for any w € Fe_1, we have v = ow € AEM, Fy), and hence, 
for any u € Hp (M, Fy), 


B(u,v) = = (oo(2,v)9"u, o9(2,v)w) ds 


~_[ (aole.»)*o0(2,r)(0"u) ,w) dS. 


OM 


(A.15) 


This vanishes for all w € Fe_, if and only if o9(x,v)*o9(x,v)(V*u) = 0 on 
OM, which in turn occurs if and only if 7»(x,v)(0*u) = 0 on OM. This estab- 
lishes the lemma for b = R; the case b = A is similar. 


Of course, the method of proof of the existence and regularity results in 
Propositions 9.4-9.7 of Chap.5, via Lemma 9.2, does not extend to this more 
general situation. It is conceivable that one of the boundary conditions, (A.13) or 
(A.14), for L = D*D, could be regular for all £2, for some @, or for no @. Since 
L is strongly elliptic, Proposition 11.13 of Chap.5 can be used to examine regu- 
larity. We will now investigate consequences of the hypothesis that one of these 
boundary conditions is regular, for L acting on sections of Fy. We will call this 
hypothesis Reg(v, @, b), with b = Ror A. 

Let us define the unbounded operator Dy, on L?(M, Fy) > L?(M, E) to 
be the closure of D acting on Hj(M, Fy). Let Dj, denote the Hilbert space 
adjoint of De, an unbounded operator on L? (M, E). Then £ = Di, Dep is an 
unbounded, self-adjoint operator on L?(M, Fy), with dense domain D(L). Since 
for allu € D(L), v € H}(M, Fy), we have (Lu, v) = (Du, Dv), taking v € Fy 
compactly supported in the interior M/ implies Lu = D* Du in D’(M). Hence 
u has well-defined boundary data, in (A.13) or (A.14), and, by Lemma A.1, 
the appropriate boundary data vanish. Therefore, the regularity result of Propo- 
sition 11.14 in Chap.5 is applicable; we have D(L) C H7(M, Fy), under the 
hypothesis Reg(v, ¢, b). The reverse inclusion is easy. If we define £L, to be D* D 
on H, ?(M , F:), it follows that £;, is a symmetric extension of L, but a self-adjoint 
operator cannot have a proper symmetric extension. Thus 


D(L) = Hi (M, Fr), 
granted the hypothesis Reg(v, 2, b). We restate this as follows: 
Proposition A.2. Under the hypothesis Reg(v, ,b), the operator L defined by 
(A.16) D(L) = H}(M, Fi), Lu= D*Duon D(L) 


is self-adjoint. 


A. Elliptic complexes 529 


It follows then from Reg(V, @, b) that Ker £ is a finite-dimensional subspace of 
Fe; call it HE. Parallel to (9.38) of Chap. 5, we have 


(A.17) uc He? — uc A}(M, Fy) and Vu=0*u=0. 


Denote by P? the orthogonal projection of L?(M, Fr) onto H2. As in (9.38)- 
(9.39) of Chap. 5, we have continuous maps 


(A.18) G? : L?(M, Fe) — H?(M, Fr) 


such that G® annihilates #12 and inverts L on the orthogonal complement of 11, 
so 


(A.19) LG’u = (I — P®)u, foru € L?(M, Fr) 
and, by elliptic regularity, G? : H/(M,F,) — H’+?(M, Fy). The following 
result generalizes Proposition 9.8 of Chap. 5. 


Proposition A.3. Granted Reg(v, ¢,b), then givenu € H/(M, Fy), j > 0, we 
have 


(A.20) u= 00 Gut HIG u+ Plu = Phu+ Po.u+ Peu. 


The three terms on the right side are mutually orthogonal in L?(M, F). Further- 
more, 
P®, P’., PP : HI(M, Fy) —> H?(M, Fy). 


Proof. Only the orthogonality remains to be checked. As in the proof of 
Proposition 9.8 of Chap. 5, we use 


(A.21) (Ju, v) = (u, ¥*v) + y(u, v), 
for sections u of F’;_; and v of F’;, with 
1 1 
(A.22) y(u,v) = = | (o9(a,v)u,v) dS = — | (u,ay«(a,v)v) dS. 
i i 
aM aM 


Note that y(u, v) = 0 if either uw € H}(M, Fj-1) or v € H4(M, F;). In particu- 
lar, we see that 

(A.23) u € Hp(M, Fj-1) => Jul ker 0* 9 H'(M, F;), 

(A.24) v € Hi(M, F;) > 0" L ker 0 H"(M, F)-1). 


From the definitions, we have 


0” : H2(M, F)) — Hp(M, F;-1), 


(A.25) : ! 
v: AA(M, F;) —> AA(M, Fy41); 


530 12. The 0-Neumann Problem 


NiO) 

(A.26) 00* H?,(M, Fy) L ker 0* 9 H*(M, Fy) 
and 

(A.27) 00H (M, Fy) L ker 0 H'(M, Fy). 


Now (A.26) and (A.27) imply, respectively, for the ranges, 
(A.28) R(PR) LR(P2)+R(P®) and R(PH) 1 R(P$) + RPA). 


Now, if u € H/ and v = 0G" w, then 7(u,v) = 0, so (u, ¥*v) = (Vu, v) = 
0. Similarly, if v € Hf and u = 0*G4w, then 7(u,v) = 0, so (Yu,v) = 
(u, 0*v) = 0. Thus 


(A.29) R(PE) 1 R(P®) and R(Ps) 1 R(PA). 


The proof is complete. 


Even though the proof of Proposition A.3 is perfectly parallel to that of 
Proposition 9.11 of Chap. 5, we have included the details, as they will be needed 
for an argument below that is not parallel to one of §9 in Chap. 5. 

The application made to relative cohomology in (9.51)—(9.55) of Chap. 5 does 
not have a straightforward extension to the general setting. The natural general- 
ization of C°°(M, A*) in (9.51) is 


(A.30) FR = {u © Fy: o9(x,v)u =0o0n am}, 
but in contrast to (9.52), we cannot expect in general to have 
(A.31) 0: FR — FR. 


Of course, we do have 3 : Fy, —> Fy4.1. We can define €* c C* C Fy as the 
image and kernel of V, respectively, and then we have cohomology groups 


(A.32) Wie Soe. 


The argument in Proposition 9.11 of Chap. 5, relating H*(J/) to the space He of 
harmonic forms, used a homotopy argument, which has no analogue in the general 
case. However, another approach works, to give the following: 


Proposition A.4. Under the hypothesis Reg(0,£, A), there is a natural isomor- 
phism 


(A.33) He = H(d). 


1. The 0-complex 531 


Proof. Let u € Fe, Ju = 0. Use the orthogonal decomposition (A.20), with 
b = A, to write u = Pfu + PjLu + PAu. Now (A.27) implies P/Lu = 0, so 
u = 0(0*GAu) + PAu, hence every u € C* is cohomologous to an element of 
H;\. Thus the natural homomorphism arising from Hj! Cc C*, 


&: Hi — H'(0), 


is surjective. The proof that & is injective is parallel to the argument used in 
Proposition 9.11 of Chap. 5. If vu € HA and v = Ju, u © Fe_y, then y(u, v) = 0, 
so (v,v) = (Ju,v) = (u,0*v) = 0. Hence & is injective, and the proof is 
complete. 


With this sketch of elliptic complexes done, it is time to deliver the bad news. 
The regularity hypothesis is rarely satisfied, other than for the deRham complex. 
The most fundamental complex that arises next is the 0-complex, for which the 
regularity hypothesis does not hold. However, for a certain class of domains M, 
one has “subelliptic estimates,” from which useful variants of Propositions A.3 
and A.4 follow. We will explore this in the rest of this chapter. 


1. The d-complex 


To begin, let us assume 22 is an open subset of C”. Standard complex coordinates 
on C” are (21,...,2n), with z; = x; + iy;. We identify C” ~ R?”, with coor- 
dinates (21,...,2n,Y1,---,;Yn)- A (p, qg)-form on 22 is by definition a section of 
C @ APT4T*O of the form 


(1.1) u= > _ up,(z) dz? \ dz”, 
By 
where 
(1.2) dz® = dzg, A---Adzg,, dz? =dzy, A-+-AdZy,, 
with 
(1.3) dz; =dx;+idy;, dz; = dx; —idy,;. 


We impose the same anticommutation relations on wedge products as before, so 


(1.4) 
dz; \dz, = —dz,Adz;, dz; \dz, = —dz,Adz;, dz; \dz, = —dz, Adz;. 


If the coefficients ugy in (1.1) belong to C°°(Q), we write u € A?7(Q2); if they 
belong to C™°(Q), we write u € A?4(Q)). There is a differential operator 


532 12. The 0-Neumann Problem 
(1.5) O: AP4(Q) — APIt1Q) 


defined by 


bux = 2 dz; \ dz? \ dz 


(1.6) ByYd ug 

= (7)? a B ae sy 

(-1) . az, dz” \ dz; \ dz". 
ByYsd 
Here, we define 0/0Z; by 
Ov 1 /f dv Ov 
1.7 = + 4 

td 02; G a) 


so that a complex-valued function v € C%(Q) is holomorphic if and only if 
dv/0z; = 0, 1 < j < n. Equivalently, v € A°°(Q) is holomorphic if and 
only if Ov = 0. The operator 0 has some properties in common with the exterior 
derivative d. For example, just as d? = 0, we have 


(1.8) 5 =0, 
by virtue of the identity 0?ug, /OZ,0%; = O° ug, /0Z;0%, and the relation dz; 
dz, = —dz;, \ dz;. Thus we have, for each p, a complex: 
(1.9) wad 2 AP-9(Q) z AP-at1(Q) 2 oe. 


As in (A.6), we form the second-order operator 


(1.10) =00 +00: A”1(Q) — A”7(0). 


When 0 is given by (1.6) and we take dz® ( dz? to be orthogonal to the form 
dz? dz’ when the indices satisfy 


Pye nie 2 ae SB, 
yy. OR 


(8.1 4(8,), 


and we take dz? A dz’ to have square norm 2?+4, we obtain 


1 07 uB, 0 ug, B ay 
(1.11) Ey 3a? eae dz® A dz’, 


1. The 0-complex 533 


when w has the form (1.1). In other words, Ou = —(1/2)Au, where Au is com- 
puted componentwise. The “absolute” boundary condition (A.14) becomes 


(1.12) o5*(z,v)u=0, o5*(x,v)Ou=0 on dQ. 


This is the (homogeneous) 0-Neumann boundary condition. 

Now the system (1.11)—(1.12) does not generally yield a regular elliptic bound- 
ary problem. If it did, the frozen-coefficient boundary problem on any region 
O c C” bounded by a hyperplane would also be regular. We can investigate 
such a boundary problem as follows. 

First, applying a rotation by a unitary matrix acting on C”, we can take O to 
be {z € C” : Im z,, > O}. Let us consider the case (p, g) = (0,1), so 


(1.13) i=). a dz;. 


n—-1 
a4 Ou; -_ Oun = 
(1.14) op: (a, v)Ou = y az dz; = Oz, dzx, 


so the boundary condition (1.12) says that, for z = (z’, 2,0) € OO, we have 


(1.15) Un(z’,%n,0) =0, —2(z',an,0)=0, 1<j<n—1. 


Thus, in this case the 0-Neumann problem decouples into n boundary prob- 
lems for the Laplace operator A acting on complex-valued functions. One is the 
Dirichlet problem, which of course is regular. The other n — 1 are all of the form 


BEG) = 0. 


(1.16) Av = f on OQ, 
OZn 


Equivalently, we can investigate regularity for 


(1.17) Av = 0o0nQ, S (2! 9,0) = g(z',2n). 


If we attempt to write v = PI h, then g and fh are related by 


1/0 ; 


where NV is the Neumann operator for A, given by 


534 12. The 0-Neumann Problem 


(1.19) (NAY (E, 0’) = —(lEl? + In’ [?)/2A(E, 1), 


where € = (&,...,€,) and 7’ = (m,.-.,%m—1) are variables dual to x = 
(@1,...,%n) and to y = (y1,.--, Yn—1), respectively. Thus, 


120) (<2 +a) Wr (en!) = -1(G + VEP FHP) ALG) 
We see that the pseudodifferential operator 0/Ox,, + iN is not elliptic. The ray 
(€,7’) on which &; vee En—1 0 va oo Nn—1 but E, < 0 is 
characteristic for this operator. Since this operator is not elliptic, the boundary 
problem (1.17) is not regular. Consequently, if n > 2, the 0-Neumann problem is 
never a regular elliptic boundary problem for (0, 1)-forms. 


Exercises 
1. Define 0: A?4(Q) = A?th4(Q) by 


(1.21) du= >- Ou dz; \ dz? \ dz” 
J 


BYYs5 


when w is given by (1.1) and we set 


1.22 = 

( ) 02; 2 Ox; “Oy; 
parallel to (1.7). Show that 
a’ =0 
2. If u = >> u; dz;, show that 
_* Ou ; 

12 =-2 pected 

(1.23) Ou » Bz, 


More generally, calculate dO on (p,q)-forms. Then verify the formula (1.11) for 

=00 +09. 

3. When dim Q = n, show that the 0-Neumann problem for (0, )-forms is equivalent to 
the Dirichlet problem for A acting on scalar functions and consequently is coercive. 


2. Morrey’s inequality, the Levi form, and strong 
pseudoconvexity 


The following estimate of C. B. Morrey provides the first useful handle on the 
O-Neumann problem. 


Proposition 2.1. [f Q is a smoothly bounded region in C” that is strongly pseu- 
doconvex, then, for some C > 0, 


2. Morrey’s inequality, the Levi form, and strong pseudoconvexity 535 
(2.1) |Gul|2.2 + ]O ullz2 > cf jul? ds, Vue D". 
fek@) 


Here, D°:' consists of smooth (0, 1)-forms on © satisfying the zero-order part 
of the 0-Neumann boundary condition (1.12). More generally, we set 


(2.2) DP = {ue A™4(Q) : o5* (az, v)u = 0 on AO}. 
We will define “strongly pseudoconvex” below, after deriving an identity that 


leads to (2.1) once the appropriate definition is made. 
We prepare to work on the left side of (2.1). Writing u = >> u; dz;, we have 


5 Ou; OUR) ae Ou; 


and if p € C%(Q) is a real-valued, defining function for Q, so p = 0 on OQ and 
p <0onQ, while |Vp| = 1 on OQ, then 


Op 
(2.4) uEeD >> "ia, =0 on a. 
Thus, for u € D®!, 


= Ou;  Oup||? 
|Julzs = 4-2 - 
j< 


rs OZK 02; L? 


(2.5) 


1 (Se Bes aw 


=> eel. 


Integration by parts yields 


Ou; OUR _ Pu; Op Ou; _ 
Ge (sean,%) a Oz; OZ, mee 
(2.6) a 


_ (Juz OuK Op Ou; _ Op Ouj 
7 Cae + {15 OZ, v= OZ, dz; « a 


aa. 


The condition (2.4) implies that 7,,(0e/0%)Ux = 0 on OQ, so the last term on 
the right side of (2.6) vanishes after being summed over k. Also, (2.4) implies that 
>>, UrO/OZ, = Z is a tangential derivative on OQ. Hence 


(2.7) we 55, a "ae =0 on OQ, 


536 12. The 0-Neumann Problem 


so 


Op Oui 0" p 
(2.8) ye Oz Oa =<). ay Pu on 0. 
Thus (2.6) becomes 
Ouj Our _ Ouj Our _ 
Ge) are - SG oe Xf enim a; 
jk jk Ik JQ 
where 


0p 
2.10 Lj = 
( ) jk 02; OZp 

Since the first term on the right side of (2.9) is equal to (1/4) ||O' ull», we have 
from (2.5) the identity 


(2.11) OullZ2 +9 ullZ2 = ‘> Meee al 


ttf (So Linum) dS 
aq Ik 


The integrand in the last integral involves the Levi form, a sesquilinear form 
defined as follows on the “holomorphic tangent space” of OQ. If p € OQ, we set 


(2.12) L£(a,b) = Lyx (p)ajbs - oa ———ajbk, Va,b € §,(80), 
jk 
where 
0 
(2.13) (OQ) = {a Ec”: Lael) = 0}. 
j 


Note that 9, (OQ) is precisely the maximal C-linear subspace of the tangent space 
T,OQ CR?” = C". It is readily verified that (2.12) is unchanged if p is replaced 
by another defining function p, satisfying the conditions specified above for p. 
(This fact is also an immediate consequence of the formula (2.21) below.) 

By definition, a smoothly bounded domain 2 C C” is strongly pseudoconvex 
if and only if its Levi form is a positive-definite Hermitian form on ,,(0Q), for 
all p € OQ. 


In view of the fact that, for wu € D®-, the n-tuple (u, (v)) belongs to 5, (0Q) 


for each p € OQ, we see from (2.10) that if Q is strongly pseudoconvex, then (2.1) 
holds. In fact, we have a stronger estimate: 


3. The $-estimate and some consequences 537 


_ = _ yp 
(2.14) |[Gul|22+]d°ul[22 > 4) ||0u;/0% cal Jul2dS, Vue Do, 
ik an 


Exercises 


1. A smooth function f : C” — R is called “strongly plurisubharmonic” if 


Of 
0z; OZk 


(2.15) ( ) is positive-definite. 


Given such a function, show that Q = {z € C” : f(z) < 0} is strongly pseudoconvex 
if it is not empty. 

2. Show that any strongly convex, smooth f : C” — R is strongly plurisubharmonic, and 
deduce that any strongly convex, bounded 2 C C” is strongly pseudoconvex. 

(Hint: See (B.13).) 

3. Suppose Q is a bounded domain in C”, O a neighborhood of OQ, and f : O — Ra 
smooth function such that f = c = const. on 02 and (2.15) holds on O, while f <c 
in 2. Deduce that (2 is strongly pseudoconvex. 

4. Suppose conversely that 9 is strongly pseudoconvex, with defining function p as in 
(2.10). Show that, for sufficiently large 1 > 0, f = e%? satisfies (2.15) on a neighbor- 
hood of 02. 

(Hint: Use the identity 


OF - yew FP, yar» Op Op 
02; 02k 02z;0%b Oz; OZ 


5. Given f : O — R such that f = c = const. on OC and (2.15) holds on O, a 
neighborhood of 0Q, while f < cin Q, and given p € OQ, consider the function 


216) 92) => SA 0) (a —P) +5 gee) — Pi) ex — Pv. 


Fi 02; 
Show that p has a neighborhood U such that 
S={zEeU:g(z) =0} = SND = {p}. 


(Hint: Write out the power series of f(p + hz), to O(h®), using 0/0z;,0/02;, etc., 
rather than 0/0x;, 0/Oy;, etc., to see that 


2 
f(p + hz) = f(p) +2Re g(p + hz) +h? > sea + O(|hz|*).) 
a 


3. The 5-estimate and some consequences 


Here we will derive a “1 /2-estimate” from Morrey’s inequality, and discuss a few 
consequences, before establishing higher-order a priori estimates and regularity 


538 12. The 0-Neumann Problem 


in the next two sections. Throughout this section we assume that 2 is a bounded, 
strongly pseudoconvex domain in C”. 


Proposition 3.1. For some C' > 0, 

(3.1) |Sullz2 + F ullz2 > Cllullja, Vue D. 
Proof. From (1.10) and (1.11) we have 

(3.2) K||Aull3y—1 < [lull32 +O" ull32, 

and together with (2.1) this yields (for various ky > 0) 


(3.3) — |JOul|22 + | -ull2. > K||Aul]?_. + K | jul? dS, Vue D1, 
dQ 
Now we claim that regularity for the Dirichlet problem implies 
G4) Jul, < Ki Auli +K f Jul? as, 
0Q. 


and this yields (3.1). 
To see (3.4), suppose Nu = —(1/2)Au = f, uf = g. Write u = ui + Ua, 


where 


(3.5) Au =—2f, ui] =0; Aus =0, w| — 
fok@) eke) 


Then results of Chap. 5, §1 imply 

(3.6) lleallz < Clif ll, 
while Propositions 11.14 and 11.15 of Chap. 5 imply 
(3.7) Iluallzr2 < Cllallzacany: 


More precisely, using the spaces H(j,,,)(C) defined in §11 of Chap. 5, where C 
is a collar neighborhood of 0Q, we have 


(3.8) g € H?+8-4/2(A0) => uz € He,5)(C), 


and, in particular, if || - ||(,,;) denotes the norm in A,,)(C), lluallz 3/2) 


Cllgllt2(a0)3 hence 


(3.9) lulls, 1/2) < CllOul|z2 + Cl] ullz2, Vue Dt. 


3. The 3-estimate and some consequences 539 


Recall from (11.95) of Chap. 5 that if C is identified with [0,1) x OQ, then 


k 1 
3.10) lula) =o ff Df )TBrsee-s 00 & 
5=0 


Note that (3.7) is basically equivalent to the statement that the Poisson integral 
has the property 
PI: L7(8Q) — H"/?(Q). 


This also follows from results in §12 of Chap. 7. 

We next define a self-adjoint extension of L] = 00 +0 don (0, 1)-forms, sat- 
isfying 0-Neumann boundary conditions. Let H, be the Hilbert space completion 
of D°:! with respect to the square norm 


(3.11) Q(u,u) = ||OullZ2 + ||O ullze. 


We can identify 1, with the closure of D® in {u € L?(M,A°) : Ou,d-u € 
L?(Q)}. Then we have a natural, continuous, dense injection Hi @ Ho = 
L?(Q, A°*). Thus, the Friedrichs extension method (discussed in §1 of Chap. 8 
and in §8 of Appendix A) yields an unbounded, self-adjoint operator £ on Ho 
such that 


D(L) = {ue Hi: v4 (v,u)x, is Ho-continuous}, 


3.12 

( ) (Lu, v) = (u,v)1,- 

Note that 

(3.13) DV cpLyeDpe) = ii, 


the latter identity by Proposition 1.11 of Chap. 8. The estimate (3.1) implies the 
inclusion H'/?(Q, A°-') > Hy, so 


(3.14) D(L) c Die?) c AY*(0, A"). 


The characterization (3.12) implies that, for all u € D(L), 


1 
(3.15) Lu=Ou= —5Au in D’(Q). 
Thus, interior elliptic regularity implies 


(3.16) DL) c HAO AQ). 


540 12. The 0-Neumann Problem 


We see that £ has compact resolvent. Since ||L'/2ul|?. = (u,w)n, > 
C|ull2,:/2 for all wu € D(L'/), zero is in its resolvent set, so £~ is a compact, 
self-adjoint operator on L?(Q, A°%*), 

Our next goal is to demonstrate that elements of D(L) do indeed satisfy the 
0-Neumann boundary conditions. First, if u € D(£!/2), then since Ou € L?(Q) 


and 0 u € L?(Q), it follows that u| is well defined in D’(0Q). Indeed, since 
Q 
u is a limit of a sequence u; € D°:! in H,-norm, we can deduce from (2.1) that 


ul € L?(0Q) and w5| _ u| in L?(0Q). It follows that 
AQ a aa 


(3.17) ue D(LM?) => a5(x,v)u=0 on OO. 


Furthermore, if u € D(L), so Lu = f € L?(9,A°"), we can write u = 
uy + Ug where u; € H?(Q) M Hd (Q) solves Au, = —2f and ug € H'/?(Q) is 
harmonic. It follows that 


| © H3/2(aQ), Bu,| © H'/2(aQ). 
0Q 0 


Since uz is harmonic, u2 , and Dus| are well defined, in D’(0Q). Hence u 
Q Q 
and Bul are well defined. The same argument also applies to du. 
We now establish the following. 


Proposition 3.2. If u € D(L), then w satisfies the boundary conditions (1.12), 
namely, 


(3.18) a5*(z,v)u=0, o5+(a,v)Ou=0 ondQ. 


Proof. The first identity in (3.18) follows from (3.17). To get the second identity, 
note that if 2u = f, we have 


(3.19) (Gu, dv) z2 + (Fu,d v)r2 =(f,v)2, Vue D. 


We have already noted that 00 u+d du= f in Q. Furthermore, the comments 
above imply that, for all v € A°1(Q), 


(3.20) (Ou, Ov) ;2 + (0 -u,d v)z2 = (Ou, v) 2 + (u,v), 


where, in parallel with (A.12), 


(3.21) B(u,v) = = | [loa (e.n) Ou, v) + (Ou, 05+ (a, v)v) dS. 
on. 


The last term in the integrand vanishes if v € D°:!, so we deduce that 


3. The 3-estimate and some consequences 541 


(3.22) ue DL) => [ort v)0u,v) dS=0, Vue D*, 


0a. 


In particular, (3.22) holds for v = o5«(a,v)y on OQ, for any py € A®?(Q), so 


(3.23) ue D(L) = [ (crow. y)dS=0, Vye A®?(Q), 
oo. 


where @ is short for 75 (a, v). This implies that o*o annihilates Ou on OQ. Since 
ul has been shown only to be in D’(0Q), we need a little care in deducing 
Q 


that o annihilates Ou on OQ, but since o(x)*o(x) is a smooth, projection-valued 
function on OQ, this implication follows, and Proposition 3.2 is proved. 


For a converse of sorts, suppose u € H, and Ou = f € L?(Q). The argu- 
ment below (3.17) implies that uf and Ou. are well defined in D’(0Q). 
io) Q 


Also, (3.20)—(3.21) hold for such u and for any v € D®!. Hence, as long as 
05: (#, vu = 0 = o5*(x,v) Ou on OQ, we have 


Q(v,u) < C(u)|lullz2, Voe DY. 
In view of the characterization (3.12), we have the following result: 


Proposition 3.3. The domain of L is specified by 


D(L) = {u EH, : Ou € L°(9), 05: (a, v)u = 0, 


(3.24) a 
oy (x,)Ou = 0 on an}. 


We record another regularity estimate: 
Proposition 3.4. [fu ¢ D(L), then u € H(i,-1/2)(C) and 
(3.25) lull —1/2) S$ CllLullzallull ze. 
Proof. If u € D(L) c D(L'/?), then the estimate (3.9) holds. Hence 
lull’ 1/2) S C(u, uray = C(Lu, u)z2. 
Applying Cauchy’s inequality yields (3.25). 


Exercises 


1. Consider the space 


542 12. The 0-Neumann Problem 
Ht = {u € 17(9,A%") : due L?, Tue L’, oe (x, v)u| = 0}. 
aQ 


with square norm ||Ou||?,2 + ||" ul|Z,2 + ||u||22. Try to show that D®" is dense in H® 
and hence that HF = Ti: 
2. For small s > 0, let 
Us = {z € 2: p(z) = —s}, 
so No = OQ. Assume that, for 0 < s < b, Ns is a smooth surface on which Vp # 0. 
Show that 


(3.26) “sup llulz2c.) < C (lulz: + |S°ulze), weD™. 


(Hint: Follow the argument using (3.5)—(3.7), but replace (3.7) by 
PI: L?(8Q) —> L?(X,), 


with an appropriate norm estimate.) 
3. Show that (3.26), together with the fact that 


(C427 * + E7(0, 4°") — He(0, A), 


implies that (C + I)~+ is compact on L?(Q, A°-"), without making use of (3.14). 
Compare [Mor], p. 336. 


4. Higher-order subelliptic estimates 


We want to extend the estimates (3.9) and (3.25) to estimates on higher derivatives 
of u € A°4(Q) MN D(L), in terms of estimates on Lu. The associated regularity 
results will be established in §5. As in §3, we make the standing assumption that 2 
is a bounded, strongly pseudoconvex domain in C”. We begin with the following 
improvement of (3.25). 


Proposition 4.1. For u € A°1(Q) A D(L), we have 


(4.1) lullzn < Cl\Lullze- 


Proof. It suffices to consider the case where u is supported on the collar neigh- 
borhood C of OQ introduced in §3. As there, we identify C with [0,1) x 0. 
Let A!/? € OPS‘/?(8Q) be an elliptic self-adjoint operator, with scalar prin- 


cipal symbol acting on sections of Ao . Arrange that A!/? commutes with 


0a 
P(x) = o(x)*o(a), where o(x) = o5*(x, v). Note that 
(4.2) llullz2(c) oS Aral 3 72) 


Now, if u € A°1(Q) A D(L), then A/2u € D+ and, by (3.9), 


4. Higher-order subelliptic estimates 543 
(4.3) Atul, aay < CQ(AM?u, Alu). 


Below, we will show that, for a certain smooth family of operators A(y) € 
OPS"(0Q), we have Au € D°+ and 


(44) Q(AM?u, AY?u) = Q(u, Au) +R, [RI < Cllulléy,-1/2). 
Granted this, we have (4.3) dominated by 


C Re Q(u, Au) + Cllull(, 1/2) = C Re (Lu, Au)z2 + Cllullq, 1/2) 


(4.5) 2 
< Cl|Lullzellull(o.ay + Cllulla—1y2y- 


Writing C||Lul|z2||ul] (0,1) < (C?/e)||Lul]Z2 + el|ul|%. and absorbing the latter 
term on the left, we have 
(4.6) Ileull Fr = C\\Lullz2 + Cllullés,—1/2)- 


If we use (3.25) to estimate the last term and recall that zero is not in the spectrum 
of £, we have (4.1). 


Our next step is to obtain higher-order a priori estimates in the tangential 
directions. 


Proposition 4.2. For u € A°1(Q) MN D(L), k > 1, we have 
(4.7) Well 4/2—1/2) = Cll Lull%,4/2-1/2) _ Cllullz2. 


Proof. We will prove (4.7) by induction on k; the case k = 1 is implied by (4.1). 
To begin, we have 


a8) lel aey2—2/2) ee eal 1/2) <Cc Q(A*/u,A*/u), 


the latter inequality by (3.9), since A*/2u € D°!. Now, extending (4.4), we have 
(4.9) Q(A*/?u, AF/?u) = Q(u, Agu) + Re, [Rel < Cllulltie/2—1)> 


for a certain smooth family of operators A,.(y) € OPS*(0Q), for which Ayu € 
D°:!, as will be demonstrated below. Thus (4.8) is dominated by 


C Re Q(u, Au) + CllullG 42-1) 
(4.10) = C Re (Lu, Axu)z2 + Cllull ¢/2-1) 
< C||Lul|(0,4/2-1/2)|lUll(0,4/241/2) + Cllullés 4/21): 


As in the passage from (4.5) to (4.6), this implies 


544 12. The 0-Neumann Problem 


C11) Ills e212) = Cl|Lull(o,¢/2-1/2) re Cllullties2—1)> 
which by induction on k yields the desired estimate (4.7). 


We now take up the task of proving (4.4) and (4.9). It will be convenient to 
assume that the diffeomorphism C ~ [0,1) x OQ has the property that Lebesgue 
measure on C, induced from that on C”, coincides with the product measure on 
(0,1) x OQ, up to a constant factor, a matter that can be arranged. We retain the 
fiber metric on A°!; on {y} x 90 this fiber metric depends on y. Then A*/?, 
originally specified to be self-adjoint on L?(0, A°*), has the property 


(4.12) (A*/2u, v)r2(¢) = (u, Agar) 22, 
where 


Agjou(y) = A*Pu(y) + Be(y)uly), 


4.13 
— B(y) € OPS*/2-1(AQ), By (0) = 0. 


Then we take 
(4.14) Ax(y) = AgjoA*/? = A¥ + By(y)A*/?. 


Clearly, A; preserves D°'. 

Now, if we also let A*/? denote an elliptic self-adjoint operator in the class 
OPS*/?(Q), acting on on sections of A®%° and A°?, having the same scalar 
principal symbol as the one acting on sections of A°+, we can write 


(OA*/?u, OA*/2u) 12 = (Ou, Ap/2OA*/?u) r2 


(4.15) _ - 
+ ((0, A*/?Ju, OA*/2u) po. 


Further commutator pushing, plus use of the fact that the left side of (4.15) is 
real-valued, yields 
(4.16) (OA*/2u, OA*/2u) 12 = Re (Ou, OAu) 1,2 + Rua, 
where 
Rr = Re { (13, AF/Vu, BA*/2u) + (du, (Ayo, dA*/?u) } 
12 i 
Re (Rx + Rx3). 


(4.17) 


Now 


4. Higher-order subelliptic estimates 545 


Ryo + Rez = (1, AF? Vy, OA*/?u) a (3, Ag/2|A*/2u, du) be 


7 
= ((6. A¥/2\u, (8, A*/}u) fs (Bild, A¥/?Ju, 5u) 

(4.18) BR e 

+ ([a*?, (9, A¥/}Ju, Du) eer ([Bx, J]A*/2u, Bu) . 
= Rea + Res + Reo + Rez, 
and standard pseudodifferential operator estimates yield 
(4.19) [Real + |Res| + |Ree| + |Re7l < Cllullé 472-1): 
which consequently bounds Re (Rx2 + Ry3). The term 

(OAK, T A*/2u) 72 


has a similar analysis, so the estimates in (4.4) and (4.9) follow, and the proofs of 
Propositions 4.1 and 4.2 are complete. 
The following is our main a priori estimate. 


Proposition 4.3. For u € A°1(Q) A D(L) and j,k > 1, we have 


(4.20) llell?;, 4672-12) < C\|Lullt;—14/2-1/2) Tr Cllullz2, 
and hence 
(4.21) Ilull3rs < ClLull pss. 


Proof. It suffices to prove (4.20) since the k = 1 case of this plus the invertibility 
of £ implies (4.21). Note that the 7 = 1 case of (4.20) is precisely the conclusion 
of Proposition 4.2. We will give an inductive proof for 7 > 2. Note that if 7 > 2, 


(4.22) llellé),4/2-1/2) i || DFl0;—-2,4/2-1/2) a Ill; —1,6/241,2)- 


Now since Lis elliptic, we can use the standard trick of writing Deu in terms of 
u, second-order tangential derivatives of u, and first-order tangential derivatives 
of D,u, to obtain 


(4.23) Ileull,46/2-1/2) = C\\Lull?;—2,4/2-1/2) + Cllullé;—1,4/241,2): 


The inductive hypothesis dominates the last term by C ILullZ; 2.4 /a+1/2) + 
C||u||7.2, and this implies (4.20). 

Note that if the 0-Neumann boundary condition were regular, we would have 
the estimate ||u||3,,41 < C||Lull?,;-1 in place of (4.21). The estimate (4.21) is 


called a subelliptic estimate. One also says that the 0-Neumann problem on a 
strongly pseudoconvex domain is subelliptic, with loss of one derivative. 


546 12. The 0-Neumann Problem 
Exercises 
1. Sharpen the estimate (4.20) to 
[|22|1Zj,%/2—1/2) < C\|Lull?; 2.472412) + Cllullze, 
for all u € A°"(Q)M D(L), provided k > 1 and j > 2. In particular, 
llullz2 < Cl|Lullfo,1) + Cllullz2. 


2. Verify (4.19), namely, that |Rxj| < Cllullés,n/2—1) for4 <j <7. 
(Hint: For example, part of the desired estimate on | Rx4| follows from an estimate 


esa 


L?(8Q) < Cllullze/2(aa); 


for any first-order differential operator X on OQ. This in turn follows since 
[x, A*/?] « OPS*/?(an). 
Similarly, part of the desired estimate on |Rx6| follows because 


[A*/? Lx, A*/?]] € OPS*—1(AQ). 


5. Regularity via elliptic regularization 


Our main goal here is to go from the a priori estimate that ||u||7,; < C||Lul|3,;—-1 
provided u € D(L) is smooth on © to the regularity result that whenever 
u € D(L) and Lu = f € HI~1(Q), then u € H/(Q). Following [KN], we use 
the method of elliptic regularization, which is the following. For 6 > 0, consider 
the quadratic form 


(5.1) Qas(u, u) = Q(u,u) +65 > |djullZ2, we D1, 


Jj 


where Q(u,u) = ||Oul|?. + |" ull2. as in §3, and 0; = O0/02;, On4j; = 
0/Oy;, 1 <j <n, applied to u componentwise. We take H15 to be the comple- 
tion of D°:! with respect to the square norm Q5. Due to the last term in (5.1), we 
obviously have 


(5.2) His = {ue H'(0, A") : o5+(z,v)u=0o0n dQ}, VS>0. 


Note that His C Hy, for d > 0, and Qs(u,u) > Q(u,u), for uw € His. Thus 
Morrey’s inequality and the proof of Proposition 3.1 apply, yielding 


5. Regularity via elliptic regularization 547 
(5.3) Qs(u,u) > Cllullzay2 + of Jul? dS + Coéllullz., ue Hrs. 
0Q. 


We will define the self-adjoint operator £5 by the Friedrichs extension method, 
so D(Ly!”) = Hy1>5 and 


(5.4) (L5u, v) 2 = Qs(u, v), WVE D(Ls), ve Hys5. 


Thus £5 is a compact, self-adjoint operator on L?(Q, A°-). Note that if u € 
D(L;), the argument used in the proof of Proposition 3.2 shows that Ou y is 

fe) 
well defined in D’(OQ), and, for v € D°: we have 


5.5) Qa(w,v) = ([- dAlu,v) |, + B5(u,v), 


where 


(5.6) Bs(u,v) = [cae du, v) _ 6( 0] dS. 


0Q. 


If we set v = 05*(x,v)y on OQ, we deduce that 


og (z,v)u=0 


(5.7) we D(Ls) => du on OO. 


1 = 
o(x, Vv) [=o5- (a, v)Ou+ oa =0 
a Vv 
For any 6 > 0, (5.3) is a coercive estimate. Such arguments as used in §7 of 
Chap. 5, for the Neumann boundary problem, produce higher-order estimates of 
the form 


(5.8) | 


ullai+2 < Cya||Lsullas, ue D(Ls), 


plus associated regularity theorems. Alternatively, the boundary condition (5.7) 
for the operator 05 = 0 — dA = —(1/2 + 6)A is seen to be a regular boundary 
condition, and the results of §11 in Chap. 5 apply. Thus, for each 6 > 0, 


(5.9) £51: HQ) 3 HPQ), 720. 


The estimates in (5.9) depend crucially on 6 of course, and one loses control 
as 6 \, 0. However, the analysis of §4 applies to £5, and one obtains 


(5.10) llullerivs < Cj|\Leullai, ue DLs) NAM), 


with C’; independent of 6 € (0, 1]. Using this, we will establish the following: 


548 12. The 0-Neumann Problem 


Proposition 5.1. The operator L has the property that 


(5.11) £1: HQ) — HT1(Q), 7 >0, 
and 
(5.12) £7: AP) — APO). 


Of course, (5.12) follows from (5.11), but it will be technically convenient to 
prove these results together, completing the proof of (5.12) shortly before that of 
(5.11). 

To begin, take f € A%1(Q) (so f is smooth on Q). Then, for each 
6>0, Ly'f = us € A 1(Q). Hence (5.10) is applicable; we have 
{us : 6 € (0,1]} bounded in H’(Q), for each 7. Thus this set is relatively 
compact in H/~+(Q) for each j, so there is a limit point 


to € ( ) H3(0, A) = A); 
j>o0 


U6, — Uo in the C'™°-topology while 6, \, 0. Now 


(5.13) (O- tA figs 

Also, 

(5.14) o5¢ (x, vyus|. = 0 => o5(a, yuo]. =0 

and 

(5.15) o5(2,v) [Fos (x, v) Ou+ pot) | =0 => o5+(z,v)Ouo| _=0 
ON et ee es Ov J laa oe an 


Therefore, uo € D(L), so 
(5.16) Lf =u € AQ). 
This proves (5.12). 
To prove (5.11), if f € H4(, A+), take f, € A°+(Q) so that f, > f in 
HI(Q). We have u, = £7" f, € A®1(Q) and, by (4.21), 
(5.17) lu — Upllasts < Cll fl. — fullas- 


Hence (u,) is Cauchy in H3*1(Q), so £71 f = lim, uy € HIt*(Q). 


6. The Hodge decomposition and the 0-equation 549 


Exercises 


1. Verify that the boundary condition described in (5.7) is a regular boundary condition 
for £5 = CsA, as defined in §11 of Chap. 5. 

2. As an approach to Exercise 1, show that the analogue of the boundary condition (1.15) 
in this case, for the region {Im z, > 0}, is 


Lg gee 


‘an,0) = l<j<n-l. 


Show that the pseudodifferential equation arising in parallel with (1.18) is 
(2 +(i+ 26)N)h = 
OLn ae 


and that, for any 6 > 0, the pseudodifferential operator acting on h is elliptic. 


6. The Hodge decomposition and the 0-equation 
We begin with the following Hodge decomposition theorem. 


Theorem 6.1. If Q is a bounded, strongly pseudoconvex domain in C”, then, 
given u € A°1(Q), we have 


(6.1) u=00 Lut OL -1u = Pau t Py. 


The two terms on the right side are mutually orthogonal in L?(Q,, A°:'). Further- 
more, 


(6.2) Py, Pew 2 HA(Q, 0°") — BPO, A), 9 > 1. 


Proof. The first identity in (6.1) is equivalent to u = C£L~*u, and the second is 
simply the definition of Ps and P5-. That (6.2) holds follows from (5.11). Only 
the orthogonality remains to be checked. 

Following the proof of Proposition A.3, we use 


* 1 
w)p2+ = [2 (x, v)w) dS, 
a 


0a 


(6.3) (Ov, w) 22 = (v,0 


valid for v € A°-7(Q), w € A°-4+1(Q). Thus 
(6.4) we Dt! —s Ow L ker INA), 
where D°:4+! is defined as in (2.2). Results established in previous sections imply 


(6.5) 6L-*-A™ ©) — PD, 


550 12. The 0-Neumann Problem 


so we can apply (6.4) to w = OL~!u to get 


(6.6) 0 OL ut ker ON A°(M). 
Hence 
(6.7) uz € AO) => Pru, L Pyeug in L?(0, A%"). 


This finishes the proof of the theorem. It also implies that P5 and P5- extend 
uniquely to bounded operators (in fact, to complementary orthogonal projections) 
acting on L?7(Q, A®+). 

The most significant application of this Hodge decomposition is to the equation 


(6.8) Ou = f, 


given f € A°1(Q), for some u € A%°(Q) = C°(A). Since J” = 0, a necessary 
condition for solvability of (6.8) is 


(6.9) Of =0. 


For strongly pseudoconvex domains, this is sufficient: 


Theorem 6.2. [f Q is a bounded, strongly pseudoconvex domain in C”, and f € 
A°:1(Q) satisfies (6.9), then there exists u € C®(Q) satisfying (6.8). 


Proof. With g = £7! f € A®!(Q), we have 

(6.10) f = 009+ 0 dg = Psf + P5f. 

However, (6.4) applied to w = Og implies P53. f L f, so in fact P5« f = 0 and 
(6.11) f =8(0 9). 


Thus we have (6.8), with u = 0 g. 
We will use Theorem 6.2 to establish the following important result concerning 


function theory on a bounded, strongly pseudoconvex domain. 


Proposition 6.3. Let 9 be a bounded, strongly pseudoconvex domain in C", and 
fix p € OQ. Then there is a function u, holomorphic on Q, such that u € C%(Q\ 
{p}), but u blows up at p. 


Proof. It is shown in the exercises for §2 that there are a neighborhood O of p and 
a holomorphic function g, given by (2.16), such that {z € O: g(z) =0}NQ= 
{p}. Now the function 


(6.12) y= —~ 


6. The Hodge decomposition and the 0-equation 551 


is holomorphic on O 1 Q and C® on ONL \ {p}, and it blows up at p. 
Pick 7 € C§°(O) such that 7) = 1 on a neighborhood O32 of p, and set 


(6.13) w= wu 
on O, extended to be 0 on the complement of O. Now consider 
(6.14) j= Ow 


on Q; we take f = 0 on O2NQ. Thus f € A°:(Q) and Of = 0, so by Theorem 6.2 
there exists 


(6.15) we2EC~(Q), Awe = f. 
Now we set 
(6.16) U=W— W2. 


We have Ou = f — f = 0o0nQ, so u is holomorphic on (2. The construction of w 
and the smoothness of wz on 2 imply that uw € C%°(Q \ {p}) and that u blows up 
at p, so the proof is complete. 


Assuming that 2 is a bounded, strongly pseudoconvex domain in C”, we con- 
struct another special holomorphic function on Q, as follows. Let {p; : 7 € Z*} 
be a dense set of points in OQ, and for each 7 let w; be a holomorphic function 
on such that u; € C%°(Q \ {p;}), constructed as above. Then we can produce 
mutually disjoint line segments 7; lying in Q, normal to 02 at p;, such that u,; 

Vi 
blows up at p;. Now consider 


(6.17) w= > cj, 


j20 


where c; € C are all nonzero, but picked so small that 


(i) lcjuj(z)| <2? on O; = {2 EQ: dist(z,0Q) > 27}, 
(ii) lejuj(z)| <2, for ze U Ve. 
l<j 


Condition (i) implies that (6.17) is uniformly convergent on compact subsets of 
Q, hence u is holomorphic on 2. Condition (ii) implies that, for each k € ZT, 
Un = isk cju,; is bounded on y;,; hence u = vg + cKuR is unbounded on ‘Yp. 
This produces a holomorphic function on Q with the following property: 


552 12. The 0-Neumann Problem 


Proposition 6.4. If Q is a bounded, strongly pseudoconvex domain in C”, then 
there is a holomorphic function u on Q, that is unbounded on each open set ONQ, 
for any open O such that ON 0O #4 0. Hence u does not extend holomorphically 
past any point in OQ. 


A domain 2 c C” having such an inextensible holomorphic function wu is 
called a domain of holomorphy. Domains of holomorphy play an important role 
in the theory of holomorphic functions of several complex variables; we refer to 
[GR, Ho3, Kr1], and [Lel] for material on this. 

We mention that, for the solution to (6.8) given by 


(6.18) u=Sf=ac"f, 


we have S : HI(Q) — H4(Q), as a consequence of (5.11). In fact, one can do 
better: 


(6.19) S: HI(Q) — HIt/2(Q), 

One method of proving this is sketched in the exercises after §9. 
Exercises 

1. Interpolate (6.2) with the L?-boundedness of P 5 and P35» to show that 


Ps, P5« : H'(Q, A°") —+ H?-*(Q,A°"), Ve>O0, 72> 1. 


(Hint: Replace 7 by Nj in (6.2).) 
Can you get rid of the e? 


7. The Bergman projection and Toeplitz operators 
We use the operator £~! on (0, 1)-forms to produce the following Hodge decom- 


position for (0, 0)-forms. Throughout this section we assume that 2 is a bounded 
strongly pseudoconvex domain in C”. 


Proposition 7.1. For all u € A®-°(Q), 
(7.1) u=Bu+0L7du 


is an orthogonal decomposition in L?(Q). The operator B, extended to L?(Q), 
coincides with the orthogonal projection onto 


(7.2) H(Q) = {u € L?(Q) : du = O}. 


7. The Bergman projection and Toeplitz operators 553 
Here, we take (7.1) as the definition of B. Thus, by (5.12), B : A%°(Q) > 
A°:°(Q). We need to prove that the decomposition (7.1) is orthogonal and that B, 


extended to L?(Q), is indeed the stated projection. 
We first note that 


(7.3) (v,0 £L-Ou) 2 = (Ov, £-18u) p22, Vu,v € A®(M), 


since the two sides differ by the integral over OO. of (v, a5: (2, v)£L~'du), which 
vanishes. This identity shows that, given v € A®°(Q), 


(7.4) dv=0=—vLIOL du, Vue AQ). 


Next we claim that 


(7.5) OBv=0, Vue A®(Q). 


This is equivalent to the statement that 
(7.6) 30 L-* du = Ov. 


Now, if we apply the decomposition (6.1) to Ou, we see that the two sides of (7.6) 
differ by 0 O£~! Ov; but this vanishes, by (6.6), so we have (7.5). 
Combining (7.4) and (7.5), we have 


(7.7) Bul 0 £7 u, Vu,v € A®°(Q), 


so the decomposition (7.1) is orthogonal. Thus 6 does extend to an orthogonal 
projection on L?(Q) and, by (7.5), R(B) C H(Q). If we apply (7.1) to an element 
u of 


(7.8) H(Q) = fu € A®°(O) : du = Of, 
we get u = Bu, so we have 
H(Q) C R(B) Cc H(Q), 


where H(Q) denotes the closure of H(Q) in L?(Q). 

In fact, since B : A®°(Q) > A®°(Q) and A®-°(Q) is dense in L?(Q), it is 
now clear that R(B) = H(Q). We could stop here (rephrasing the statement of 
Proposition 7.1), but it is of intrinsic interest to equate this space with H(Q), 


which we now do. 


Lemma 7.2. /f 0 is a strongly pseudoconvex domain in C”, then H(Q) is dense 
in H(Q). 


554 12. The 0-Neumann Problem 


Proof. It suffices to show that 
(7.9) H(Q) LOL Bu, Vue AO). 


Now, if v € H(Q) and u € A%°(Q), so w = L7-1du € Dt Cc A®1(Q), then 


(7.10) (v,0 £71 Ou) 129) = lim [ory dV, 
Qs 


where 2, = {z € 1: p(z) < —s}. We have 


(7.11) [ory av = f Bv,u) IW J (eon (e.ryu) dS. 


Qs Qs on. 


Of course, the first term on the right side of (7.11) vanishes if v € H(Q). Now, 
we can take a collar neighborhood of 0Q and identify OQ, with OQ, for s small. 


Then, for each v € H(Q), u(s) = vl, provides a bounded family in D’(0Q) as 
Qs 


s — 0. Meanwhile, for any w € D®!, 


(7.12) ow(s) = a5 (z,v)w), >0 in C™(AQ), 


as s —> 0. Thus the second term on the right side of (7.11) vanishes as s + 0, so 
we have (7.9), and the lemma is proved. 


The orthogonal projection B is called the Bergman projection. If we take as its 
defining property that B projects L?(Q) onto H(Q), then the content of Proposi- 
tion 7.1 is that we have a formula for B: 


(7.13) Bu=u—0 L ‘du, 


at least for u € H'(Q). The mapping property (5.11) implies B : H3(Q) > 
H)~'(Q), for j > 1. If we interpolate this with B : L?(Q) + L?(Q), we deduce 
that 


(7.14) B: Hi(Q) — HI-§(Q), Ve>o0, 721. 


Compare with Exercise 1 in §6. In [K3] it is proved that actually B : H3(Q) > 
HI(Q). 

Since Bu is holomorphic for each u € L?(Q), the evaluation at any z € Qisa 
continuous linear functional on L?(Q), so there exists a unique element of L?(Q), 
which we denote as k,, such that 


7. The Bergman projection and Toeplitz operators 555 
(7.15) Bu(z) = (u,kz)n2, Vue L7(Q). 
Since holomorphic functions are harmonic, the mean-value property implies that 


whenever vy, € C§°(Q) is real-valued and radially symmetric about z € 2, with 
total integral 1, then 


(7.16) Bu(z) = (Bu, vz) 12 = (u, Byz) 3, 
so, for each z € Q, 
(7.17) k, = Boz € Chm): 


Also, one can clearly choose y,(¢) depending smoothly on z and ¢, so the map 
z+ k, is C™© on Q, with values in C°°(Q). Thus we can write 


(7.18) kAC)= K(z,0), KeEeCcPax). 
Then we can rewrite (7.15) as 
(7.19) Bu(z) = [OKe9 dV (¢). 

Q 


The function K(z,¢) is called the Bergman kernel function. Since B = B*, we 
have 


(7.20) K(z,¢) = K(G, 2); 
hence (7.18) implies 
(7.21) KecmaxQ nce? x Q). 


This regularity result is due to [Ker2]. 

In [F] an analysis was made of the asymptotic behavior of K(z,z) as 
z approaches O02. It was used there as a tool to prove that if Q, and Qe 
are two bounded, strongly pseudoconvex domains with smooth boundary 
and ® : Q; — Q2 is a biholomorphism, then ® extends to a diffeomorphism 
® : Q; — Qe. Later, S. Bell and E. Ligocka [BL] found a simpler proof of this 
mapping result, relying on the property that B : C°(Q) + C%(Q) (which fol- 
lows from (7.14). Nevertheless, the asymptotic analysis of K(z, z) has substantial 
intrinsic interest. A discussion of a number of aspects of this study is given in 
the survey [BFG]. In [BSj] the analysis of K(z, z) is related to an analysis of the 
Szeg6 projection, a projection analogous to the Bergman projection but defined 


556 12. The 0-Neumann Problem 
on L?(0Q). Alternative approaches to the analysis of the Szeg6 projection are 
given in [KS] and in [Tay]. 

We turn now to a study of Toeplitz operators, defined as follows. Given f € 
L™(Q), we denote Myu = fu and set 
(7.22) Tru=B(fu), we H(Q). 


Thus Ty : H(Q) > H(Q). We call Ty a Toeplitz operator. Note that since || B|| = 
1, we have an L?-operator norm bound on T+: 


(7.23) Trl < WFllz-- 
Toeplitz operators have a number of interesting properties, some of which we 
derive here. In the statements below, £ denotes the space of bounded operators 


and K the space of compact operators, acting on the relevant Hilbert space, usually 
L?(Q) or H(Q). 


Proposition 7.3. If f,g € C(Q), then 
(7.24) TT, — Tyg € K. 
Also, if f € C(Q), 
(7.25) f| =O0=— TT; eX. 
0Q 
Thus f ++ Ty produces a homomorphism of Banach algebras 
(7.26) T: C(0Q) — L/K. 


In view of (7.23), it suffices to prove (7.24) for f, g € C~(Q). Also, it suffices to 
prove (7.25) for f € Cg°(Q). In fact, 


(7.27) f € CX(Q) => Tp: H(Q) = C™(O), 


so the compactness of T’y in this case is obvious. 
To prove (7.24), note that 


T;Tju—Ty,u= BfBgu— Bfgu=—Bf(I— B)gu, 
so (7.24) follows if we show that 
(7.28) fecea => BM;(I-B)€K. 


It is more convenient to show that 


7. The Bergman projection and Toeplitz operators 557 
(7.29) (I-B)M;BeEK, 
which implies (7.28) upon taking adjoints. To see this, let us use (7.13) to write 
(7.30) (I — B)M;Bu=90 L7'O(fBu), 


at least for u € C'%(Q). Since Bu is holomorphic, 0(f Bu) = (Of) Bu, so we 
have 


(7.31) (I — B)My;Bu=0'L7} ((O/)Bu), 


an identity that extends to L?(Q) since both sides are bounded on L?(Q). Thus 
(7.29) will be established, and the proof of Proposition 7.3 will be complete, when 
we establish the following: 


Lemma 7.4. We have 

(7.32) Od £7): £7(9, A!) —+ L?(Q) compact. 

Proof. For v € Hi, we have ||0 v||z2 < ||L'/2u|| 2, so, for u € L2(Q, A°*), 
(7.33) O° Lotullre < 0-1/2 ul|z2. 


Since £~!/? is compact on L?, (7.32) easily follows. 


Proposition 7.3 extends to the case when f and g take values in End(C’”™), and 
T;, Tg act on m-tuples of elements of H(Q). We then have the following. 


Proposition 7.5. If f € c(8, End(C”)) and f(z) is invertible for each z € OQ, 
then T's is Fredholm. 


Proof. Let g € C(2,Ena(C”)) satisfy fg = gf = I on OQ. Then 


Proposition 7.3 implies 
I-T;yT, ek, I-T,T; eK, 
so T, is a Fredholm inverse of T’y. 


It is natural to ask what is the index of T's, which clearly depends only on 
the homotopy class of f : 0Q — Gl(m,C), by general results on Fredholm 
operators established in §7 of Appendix A, on functional analysis. A formula for 
Index T’y is given by [Ven] in case (1 is a ball in C”. The case of a general, strongly 
pseudoconvex domain is treated in [B2]. The formula given there is equivalent to 
an identity of the form 


558 12. The 0-Neumann Problem 
(7.34) Index Ty = Index Py, 


where Py is an elliptic pseudodifferential operator on OQ, constructed as fol- 
lows. The manifold OQ possesses a spin® structure and associated Dirac operator 
Dag (objects defined in Chap. 10). The operator Dag is a self-adjoint operator on 
L?(0Q, 8), where $ — OQ is a certain spinor bundle. Denote by H, the closed 
linear span of the positive eigenspaces of Dag and by Px, the orthogonal projec- 
tion onto H,. If f takes values in End(C”), let P; also denote the orthogonal 
projection of L?(00, S @ C™) onto H, @ C™. Then we set 


(7.35) Ps = P,M;P,+ (I —- Px), acting on L?(0,5 @C™). 


We see that Pp € OPS°(OQ) if f is smooth, and Py is elliptic if f| _ is invert- 
dQ 

ible. The index of Py is given by the Atiyah—Singer formula; see (8.22)—(8.25) in 

Chap. 10. 


We note that the correspondence f ++ Py has properties like those established 
for f ++ Ty in Proposition 7.3. That is, if f,g € C(OQ), 


(7.36) PrPy — Prag € K, 
so f ++ Py produces a homomorphism of Banach algebras 
(7.37) mt: C(OQ) — L/K. 


In fact, (7.36) is established more easily than (7.24); if f,g € C™°(0Q), we have 
PP, — Peg € OPS~'(0Q). In addition, one can also show that 


(7.38) r(f)=7(f)*, a(f)=a(f)*. 


The maps 7 and 7 are said to produce extensions of C(0Q) by K. There are 
certain equivalence relations among such extensions, first specified by [BDF], 
and the resulting equivalence classes define elements of the ‘-homology 
group K,(0Q). In [BDT] it is proved that 7 and 7 define the same element of 
K,(0Q), a result that implies (7.34) and hence refines Boutet de Monvel’s index 
theorem. See [BvE] for another approach to this index theorem. 

We mention a variant of Pr, constructed as follows. There is a Dirac operator 
De : C@(Q,S) 4 C(O, S_), and a Calderén projector P+, the orthogonal 
projection of L?7(0Q, 5) onto the space of L?-boundary values of elements of 
Ker Dg. Parallel to (7.35), we have 


(7.39) Py = P+MyPi+(1—Py), acting on L7(00,5 @C™). 


Again Py € OPS°(dQ) if f is smooth, and P+ is elliptic if faq is invertible. 
Furthermore, 


Exercises 559 
(7.40) Index P+ = Index Py. 


This holds whenever 2. C C” is a smoothly bounded domain; it need not be 
strongly pseudoconvex. 

An advantage of the use of (7.39) is that it extends to a much larger class of 
bounded domains Q, including Lipschitz domains, and, more generally, uniformly 
rectifiable domains, defined and briefly discussed in Chapter 7, §11. In such a case, 
P is given by a singular integral operator, arising from a layer potential. One has 


(7.41) P7Py — Pha EK, 


for continuous f,g, and hence Py is Fredholm, with a well defined index, when 
f is invertible. Details on these matters are given in [MMT]. As shown there, a 
key result is a cobordism invariance, enabling one to equate Index Py on one UR 
domain 2 with that on another domain ©, often much better behaved. Sometimes 
we can take O to be a ball, and use (7.40) plus (7.34), together with results of 
[Ven], to evaluate Index Py in the setting of 2. Such calculations are derived in 
[MMT]. 


Exercises 
1. Let {w,;} be an orthonormal basis of 7#1(Q). Show that 


K(z,0) = do us (2)u5(6), 


the series converging in C°(Q. x 2). 
2. Show that 


Kk > 
ee vol 2’ 


(Hint: Take a orthonormal basis {u,;} of H(Q) with ui = const.) 


3. Show that Y(z) = log K(z, z) is strongly plurisubharmonic on 2, in the sense defined 
in (2.15). Deduce that 


VzEn. 


_ Py 

a 02502 k 
defines a positive-definite Hermitian metric on 22. This is called the Bergman metric 
on 22. 

4. Suppose F : Q) — Qp is a biholomorphic diffeomorphism between two strongly 
pseudoconvex domains 22; and (42. Compute the relation between the Bergman kernel 
functions, and deduce that F’ preserves the Bergman metric. 

5. Let B” be the unit ball in C”. Show that an orthonormal basis for 7/(B”) is given by 


! 
tale) = bas”, ba =f Oe 


Deduce that the Bergman kernel function for B” is given by 


hj (2) 


560 12. The 0-Neumann Problem 


K(z,0) =Va(1—2-OC)7 9, 


where V,, = Vol B”. Compute the Bergman metric for the ball. 


8. The 0-Neumann problem on (0, q)-forms 


So far, we have analyzed the 0-Neumann problem for (0,1)-forms, but it was 
formulated for (p, q)-forms in §1. Here we extend the analysis of §§2-6 to (0, q)- 
forms. Our first order of business is to try to extend Morrey’s inequality. We try to 
parallel the computation in (2.5)—(2.11). It is convenient to perform the computa- 
tion in a more invariant way, using (1.10)—(1.11), that is, 


(8.1) 00 u+o Ou=Ou= —4Au, 


where A acts on u componentwise, for u € A°%%(Q), Q Cc C”. We have, as in 
(3.20)-(3.21), 


(82) [Bula + [Ou = (Cu, u)ne —> f (op-(0.v) usu) ds, 


0. 


for u € D°-4, the other boundary integrand (Fu, ope (ax, v)u) vanishing in this 
case. Also, we have 


(8.3) (Qu, u)z2 =P |e. 2 


ie Ou 

Baer | X(sorea(e) seu) dS. 
aq * 

Hence, for u € D4, 


(8.4) ule + ullz2 = 2D llez, 


a tus) 


where 
(8.5) y(u, u) = — = | (os (a eyo 2) oie ee u) ds. 
’ A Oo if zn ( OZ, ’ 
dQ. 


Note that when g = 1, the first term on the right side of (8.4) is equal to the 
first term on the right side of (2.11), since |dz;|? = 2. 
Let us write the integrand in (8.5) as a(u, u) + B(u, u), with 


(8.6) a(u,u) = — + (og (2. Vv) Bu, w) = —(du, Op Au) 


8. The 0-Neumann problem on (0, ¢)-forms 561 


and 


a) Op / O 
BD — Buu) = 2 (ojala) emu) = 27 52 (eu), 
k 


k 


Note that in the case g = 1, when u = Du; dZ;, so Ou is given by (2.3), we have 


5 Op Op es 
(8.8) Op Nu= yi) & —_ wus) dz; \ dz, 
j<k 
so 
_ Ou; Oux Op _ Op _ 
aM) = S (= a) (5 tk Bx, 
j<k 
(8.9) 
_ ae Op Ou; _ Op Ou; _ 
7 - Ozp OZh Oz; OZ 
jk 


Here, the first part of the last sum cancels 3(u, uw), and the rest is what appears on 
the left side of (2.8) (multiplied by —4). Upon applying the identity (2.8), we thus 
recover the identity (2.11), for g = 1. 

More generally, if u is a (0, ¢)-form: 


(8.10) i=) ude, 
summed over a1 <--+ < dq, then 


atu) =- (Fe dz, dz se ug dz, \ dz e) 


(8.11) . au 
— _99t1 JQ) OP CMa — 
Yom (55) ade 
where sgn G i 3) is +1 if j,a1,...,@, are distinct and are an even permutation of 
k, B1,..-, Gq, is —1 if an odd permutation, and is zero otherwise. We also have 
Op OUe 
(8.12) B(u,u) = 2041 S> <P Teg 


Ga Oz; OZ; 


which cancels out the part of the last sum in (8.11) for which 7 = k anda = 3. 
We next want to extend the identity (2.8), so we look for some derivatives 
tangent to OQ, arising from u € D°-7. A calculation gives 


562 12. The 0-Neumann Problem 
(8.13) O5* (x vu = >(do sen @r oP) ax 
fa) ’ jy Qa Oz; ? 


summed over y, < +--+ < Yq—1. Thus, if u € D4, then 
rs) 
(8.14) Dosen | Maan = 0, Vy <s+!<%q-1- 


Hence, extending (2.7), we have 

B\_ O a a 
(8.15) sen (/ ai ae (sen (5 —) = 0, 
for all 71 < +++ < Yq—1 and oy < +++ < og_1. Hence 


8 a\ (Ole OP OO 
(8.16) > sgn G sgn (2) {us Oz; azn r UBUa sana! = 0, 


j,k,a,8 


for all such y and o. In this sum we also require 3; <--+ < [y. 
Now we can put (8.11) and (8.12) together with (8.16), to establish the 
following: 


Proposition 8.1. [fu € D°-4, then 


— p_F 
||Oullz2 + |l0 ullz2 


8.17 = 

(8.17) =? Lee. P| Se a i W jyWry 4S, 
0Q 

with 

(8.18) Wey = D~ sen fae 


Proof. It suffices to show that a(u,u) + 3(u, u), given by (8.11) and (8.12), is 
equal to 


a\ Op 0Ua_ 
(8.19) -YDan (7 *) sen (Seat 


where we sum over j,k, a, 3, y, with ay < +--+ < ag, and so on. 

To establish the identity at a given point p € OQ, rotate coordinates so 
Vp(p) = 0/Oyn, and hence 0p/0z~ = —(i/2)dkn. Then, at p, the quantity (8.19) 
is equal to 2 times 


8. The 0-Neumann problem on (0, ¢)-forms 563 


1 B a \ OUg 
2 = 3 3 Ug. 


That u € D4 implies that ug = 0 at p whenever n occurs in 3, so we can take 
the sum in (8.20) over 7 4 n. Meanwhile, (8.11) and (8.12) sum to 7 times 


Ja\ We _ 1 Oe _ 
(8.21) 5 ham(? aE — Up 5 : Bg, ue 


j,0,8 


at p; this is equal to 


) a 2 a 0 ee 
(8.22) so (5) 5e “ ug 5 2 : ote Dsm(| 5) oe 7 


Now the first two sums in (8.22) cancel and the last sum is equal to (8.20). This 
proves the proposition. 


Note that (8.13) is equivalent to }), Wj, 0p/0z; = 0 for all +, so the hypoth- 
esis that the Levi form be positive-definite implies 


a 
(8.23) » aan ee CWP, Vr. 
Jk J 
Hence the last term in (8.17) is 
(8.24) 2Cy / |Wj|? ds, 
iY an 


when ? is strongly pseudoconvex. On the other hand, the map u +> (W,;,) is 


clearly injective, so >? ; |W;|? > Clu|?. We hence have the following: 


Corollary 8.2. If Q is strongly pseudoconvex and u € D4, q > 1, then 


+0 [ Ju? as. 
0Q 


7 _ Ou 
2 2 
(8.25) \|Oul|z2 + ||O ullz2 => >| OZ, IIL? 


In the rest of this section, we assume that 2 is a bounded, strongly pseudocon- 
vex domain in C”. 
From here, we can use the argument from Proposition 3.1 to show that 


(8.26) |DullZ2 + F ullz2 > Cllullza, Vue D4, 


as long as g > 1, and more precisely we can use lulls _1/) on the right, as in 
(3.9). 


564 12. The 0-Neumann Problem 


As in (3.12)-(3.16), we can define an unbounded, self-adjoint operator £ on 
L?(Q, A°-*), for each q > 1, such that D(L'/?) is the completion of D°? with 
respect to the square norm Q(u, u) = ||Oul|?.2 + ||O ull?2, and 


(8.27) |L’ullz2 =Q(u,u), Vue D(LY?). 


By (8.25) and (8.26), L1/? has compact resolvent, and zero is not in the spectrum 
of £1/?, so £~! exists and is a compact operator on L?(Q, A%%), for 1 <q <n. 
Furthermore, Proposition 3.2 extends, so (3.18) holds for u € D(L). 

Then the higher-order a priori estimates of §4 and the regularity results of §5 
extend, to yield the following: 


Proposition 8.3. For 1 <q<n, 


628) £7): HI(O, At) > HAVO, AN), 5 > 0, 
and 
(8.29) Lo} : A°7(Q) =, A%9(Q). 


Thus the material of §6 extends. We have the next result: 


Proposition 8.4. Given u € A°4(Q), q > 1, we have 
(8.30) u=00 L1ut+d OL-1u = Pru + Pru. 


The two terms on the right side are mutually orthogonal in L?(Q.,, A°:*). Further- 
more, for j > 1, 


(8.31) P5, P5« : HI(Q, N°) —> HI-*(0,A%), Ve>0. 


Corollary 8.5. [fq > 1 and f € A°-9(Q) satisfies Of = 0, then there exists 
u € A°I-1(Q) satisfying Ou = f. 


Note that there is no “cohomology” here. In the more general case of strongly 
pseudoconvex complex manifolds, which will be discussed in §10, there can per- 
haps be cohomology, arising from a nontrivial null space of L on A°-4(Q), q > 1. 

We next echo some constructions of §A. We define vector bundles FE; + Q by 


(8.32) Fo =@A°, E= QA, B= HOk. 
j20 j20 


We then define the unbounded operator Dy on L?(Q, E) to be the closure of 
d+9 , acting on By>0 D7, As usual, DY is as defined in (2.2); in particular, 
we have D°-0 = A°9(Q). Note that, for g > 1, D(Dy) N L?(Q, A%®) coincides 
with DiL/*), as defined in §3 for g = 1 and in this section for general g > 1. 
Also, the orthogonality relations imply that 


8. The 0-Neumann problem on (0, ¢)-forms 565 


(8.33) D(Dy) = QB D(Dn) NL?(Q, A). 
q20 


It is easily verified from Green’s formula that Dy is symmetric; in fact, such 
arguments as needed for Exercise | in §3 imply that Dy is self-adjoint. 

General considerations imply that the Friedrichs extension method, applied to 
the quadratic form Q(u, u) = ||Dyul|Z2 on H; = D(Dy), gives rise precisely 
to the positive, self-adjoint operator D¥; Dy = D%,. In view of the construction 
of the self-adjoint operator L on L?(Q, A°7) discussed above, we have 


(8.34) D*,=L£  oneach factor L?(Q, A%*), 


for g > 1. In particular, we have the identity of the domains of these operators. 
The operator Dy provides an example of the following structure. Dy = D 
has the form 


0 A* 
(8.35) p=(44)) 


a self-adjoint operator on a Hilbert space H = Hp 6 H, where A: Hy > Hy is 
a closed, densely defined operator. In the present case, H; = L?(Q, Ej). Thus 


> (AA 0 
(8.36) pa {“, yee 


and our results on £ imply that 
(8.37) (D? +1)~! is compact on Hy. 


Of course, (D? + 1)~! is not compact on Ho in this case, since it coincides with 
the identity on H#(Q) C L?(Q,A°%°) C Ho, which is an infinite-dimensional 
space. There is another important property, namely that, for any f € C°(Q), My 
preserves D( Dy) and 


(8.38) [M;, D] extends to a bounded operator on Hy © Hy. 


Using these properties, we will establish the following, which, as we will see, 
complements Proposition 7.3. Set 


Oe Ai 


_ 2 -1/2 _ 
(8.39) F=D(D?41)'?= G ; 


) , TeAAAry. 


Proposition 8.6. The operator F has closed range, and for all f € C(Q), we 
have a compact commutator: 


566 12. The 0-Neumann Problem 
(8.40) (My, F] © K(Ho © A). 

To establish (8.40), we may as well assume f € C°°(Q). Then we can write 
(8.41) [My, F| = [My, D](D? + 1)~1/? + D[My, (D? + 1)~*/?). 


It follows from (8.38) that the first term on the right is compact on ,. Before 
looking at the last term on the right, we derive a result that gives some information 
on the behavior of the first term on Ho. 


Lemma 8.7. The operator (D? + 1j- ? is compact on the orthogonal comple- 
ment of ker D (in Ho). 


Proof. We are saying that (A* A + 1 ? is compact on the orthogonal comple- 
ment of ker A in Ho. We will deduce this via the identity 


(8.42) Ag(A* A) = g(AA*)A_ on D(A), 


which holds for any bounded, continuous function g on [0, co). The identity (8.42) 
is a consequence of the identity 


(8.43) Dg(D?) = g(D?)D_ on D(D). 


Another ingredient in the proof of the lemma is the following. Since AA* has 
compact resolvent, H, has an orthonormal basis of eigenvectors for AA*, and we 
have 


(8.44) A: Eigen(A, A* A) —> Eigen(A, AA*), 

A* : Eigen(A, AA*) —> Eigen(A, A* A). 
If X # 0, these maps are inverses of each other, up to a factor A, so they are 
isomorphisms. 

To prove the lemma, we first show that 


(8.45) g € OF (R), o(0) =0 => ¥(A*A) € K(Ho). 


To do this, write p(s) = sy1(s)y2(s)y3(s), y; € CG° (IR). Then, applying (8.42) 
with g = ~1 2, we have 


(8.46) p(A* A) = A* A(yi92)(A*A)p2(A* A) 

= A*p1(AA*)y2(AA*) Ays(A* A). 
Here, Ay3(A*A) € L(Ho, Mi), y2(AA*) € K(H,), and A*y\(AA*) € 
L(H,, Ho), so (8.45) follows. Consequently, the spectrum of A*A, which is 
contained in [0, co), is discrete on any compact interval in (0, 00), of finite multi- 
plicity. It remains to show that this spectrum cannot accumulate at 0. Indeed, the 
argument involving (8.44) shows that 


8. The 0-Neumann problem on (0, ¢)-forms 567 
(8.47) {0} U spec A*A = {0} U spec AA*, 


and since AA* has compact resolvent, its spectrum does not accumulate at 0, so 
Lemma 8.7 is proved. 


To proceed with the proof of Proposition 8.6, we next show that the last term 
in (8.41) is compact on H, and on the orthogonal complement of ker D in Ho. 
One tool will be the integral representation 


1 co 
(D? +1)-/7? = -/ RO ae dh 
(8.48) 
= =/ (D? +1+ 87)" ds. 
0) 


In order to get a convenient formula for [My, (D? + 1+ s?)~"], sett = V1+4 8? 
and write 

(8.49) (D? +) =(D+a#)1(D-#", 

so 


[M;, (D? + 27)-}] = [My, (D+ 2t)“1](D — it)? 


(8.50) + (D + it)" ![My, (D — it)7}]. 


Since, for f € C°(Q), My preserves D(D), we have 


(8.51) [M;,(D + 4t)~*] = —(D + it)“*[My, D|(D + it). 
Hence 

M,,(D? +#7)71) = —(D+it)[/M;, D|(D? +#?)7! 
(8.52) [My, (D* + t*)~"] (D + it)[My, D](D* +t) 


— (D? + t?)[My, D](D — it)~?. 


Therefore the last term in (8.41) is equal to 


9) fore) -1 
-=/ D(D+i 1+ 8?) [My, D](D? +1+ s?)~" ds 
T Jo 
2 pe - 
(8.53) = / D(D? +14 8?)""[My, D] (D- ivi +s?) ds 
0 
=7,4+ Th». 


In view of the operator norm estimates 


||D(D £it) || <1, |J(D+it) || < et, 


568 12. The 0-Neumann Problem 


for t > 0, it follows that both integrals in (8.53) are convergent in the operator 
norm. Thus 7; + T> is compact on any closed subspace of Hp 6 H, on which the 
integrands are compact, for all s € [0, 00). 

The integrand for 7; is a product of bounded operators with the factor (D?+1+ 
s’)—1, which we know to be compact on H; and on (ker D)+, so T; is compact 
there. The integrand for T> is a product of bounded operators and D(D? + 1 
ae ae It follows from Lemma 8.7 that this factor is compact on all of Hp @ A, 
so T> is compact on Hp @ Ay. 

To complete the proof of compactness of the commutator (8.40), it remains to 
show that this commutator is compact on ker D, for f € C°°(Q). In such a case 
we can write, in place of (8.41), 


(8.54) 
(Mp, D(D? + 1)-¥/?| = [My,(D? + 1)-¥/D] 
= [Mz,(D? + 1)-7]D + (D? + 1)-*/7[Mg, DI, 


on D(D). On ker D, this is equal to 
(8.55) (D? +1)-1/?[Mg, D]. 
Now [My, D] maps Ho to Hy, and (D? + 1)~1/? is compact on Hy, so (8.55) is 
compact on Ho. This completes the proof of the compactness assertion (8.40). 
Finally, the proof of Lemma 8.7 shows that zero is an isolated point of spec 
D?, hence of spec D, so D has closed range, and hence F has closed range. This 
completes the proof of Proposition 8.6. 
Another consequence of Lemma 8.7 is that 
(8.56) F-FyEK, Ko=Pt-P, 
where Pt is the orthogonal projection onto the closed linear span of the positive 
eigenspaces of D, and P~ is the orthogonal projection onto the closed linear span 
of the negative eigenspaces of D. Thus (8.40) is equivalent to 
(8.57) [My, Fo] € K, 
for all f € C(Q). Note that 
(8.58) Fe = Pt+P-=1-P®, 
where P° is the orthogonal projection onto ker D. Since 


(My, Fo] = [My, Fol Fo + Fo[My, Fol, 


we have the following variant of (8.40): 


9. Reduction to pseudodifferential equations on the boundary 569 
Proposition 8.8. For all f € C(Q), we have compact commutators: 
(8.59) [M;, Pt], [My,P7], [M;, P°] € K(Ho © M1). 


In our present situation, P° preserves each factor L7(Q, A°%) in Hyp @ Hy. In 
fact, P® is zero on all these spaces except L?(Q, A°-°), on which it is the Bergman 
projection. Thus the compactness of [M, P°] is equivalent to the compactness of 
[M, B], established in Proposition 7.3. 

The value of Propositions 8.6—8.8 as a complement to Proposition 7.3 is par- 
ticularly revealed in its relevance to the index identity (7.34). We will give only a 
brief description of this connection here, referring to [BDT] for details. As shown 
in [BDT], the results in Propositions 8.6—8.8 imply that Dy determines a rel- 
ative K-homology class, [Dv] € Ko(Q,09), and that the K-homology class 
[7] € K,(O0Q) described in §7 is obtained from [Dy] by applying a natural bound- 
ary map: 


It is then shown in [BDT] that a certain identity in Ko(, OQ) leads, via the appli- 
cation of this boundary map, to the identity [7] = [7] € Ay (O02) mentioned in 87, 
an identity that in turn implies the index identity (7.34). 
Exercises 
1. Extend the results of this section to 
o2C"(0,V @A°*) 3 CFO, V eA), 
for any finite-dimensional, complex vector space V. Deduce results for 
Oe AP) — APG). 


2. Establish an analogue of Proposition A.4 for the 0-complex. 


9. Reduction to pseudodifferential equations on the boundary 


In this section we reduce the 0-Neumann problem on © to a system of equations 
on OQ. This method provides an alternative to the sort of analysis carried out in 
§§2—5. We consider the boundary problem 


u=0 on Q, 


on (z,v)u=0, o5*(x,v)Ou=f on OQ. 


(9.1) 


We write u in terms of a solution to the Dirichlet problem: 


(9.2) u=OonQ, ul =g, 
fok@) 


570 12. The 0-Neumann Problem 


that is, w = PI g. Thus, g satisfies the equation 


(9.3) Ag = f, 

where 

(9.4) Ag = 05*(2,v) O(PIg) ea 

and we require 

(9.5) o5*(@,v)g = 0. 

Thus, we can regard A as a linear operator 

(9.6) A: C™(0Q, FE) — C°(O0, E), 
where E — OQ is the complex vector bundle 

(9.7) E = Ker o5*(x,v) = Imo5(2,v). 


Here, if we are looking at J on (0, q)-forms, the first 75+ (x, v) in (9.7) acts on 


Ao4 a and the second acts on A°¢+1 e The second identity in (9.7) follows 


from the exactness of the symbol sequence for the 0-complex. 
For simplicity, we confine attention to the case g = 1, which was studied in 
§§2-6. Thus u = PI g has the form 


(9.8) u= So uj dz;, uy = gj on 9. 
Say v = D7 (vj dary + [45 dy;), so 

(9.9) v+id*y = S095 dz;, Gy = Uy — py. 
The condition that o5»(x,)u = 0 on 00 is equivalent to 
(9.10) S > 959; = 0. 


A computation gives 
_ _ 1 Ou; Oux 
(9.11) o5-(x,v)du= S_ f; dz;, fi= GDP (St - $4). 


Note that )> y; f; = 0, a fact consistent with the second identity in (9.7). 
To express (9.11) in the form (9.3), write 


9. Reduction to pseudodifferential equations on the boundary 571 


Ou 


(9.12) Be 


1 
= 3 (Ye +aprN)qg, 


for u = PI g, where Y;, is a (complex) vector field tangent to 0Q, a, € C™(0Q), 
and NV is the Neumann operator for the Dirichlet problem: 


(9.13) jie u = Plg. 
OV 


Thus we get Ag = f, that is, f; = A;"g,,, with 


(9.14) 8; = 2 Pol Yi + anN)9; — Da vel Yj + ajN) gn 


Note that 3°, yrgr = 9 > DO, PRY¥j9n = — 2, (Vier) gx, for each 7. Thus we 
can write the system as 


(9.15) 8f; =(6¥ taN)g )9i— 05 D_eeN ox + DY, 52k) Ik 


with 


(9.16) iY= S- yrYn, a= S- Ak Pk: 
k k 


Now, (9.12) implies that Re a, = vz, and Im ay = px, or 
(9.17) ak = Pp. 


Also, of course, 1 = )i(v3 + u3) = do |y,l?, so we have a = 1 in (9.16). 
Furthermore, if p is a defining function for Q, as in §2 (so p = 0 and |Vp| = 1on 
OQ, p< 0onQ), then y, = —20p/0z, and, for all vu € C™(Q), 


Op OV laa, 
(9.18) Die oe = 5 (9v, 9p); 


so, if u; = PI g;, we have 
(9.19) (iY +aN)g; = (Ou;, Op) Ba 


Fix p € OQ, and rotate coordinates so OQ is tangent to {yn = c} at p, and 
Vp = —0/0Yn at p. Then 0p = —(i/2) dz, at p, so we have 


(9.20) (iY +aN)g; = ~ist +N; 


572 12. The 0-Neumann Problem 


We recapture the identity a = 1 and see that, in (9.15), Y is a real vector field 
tangent to OQ, namely 


(9.21) Y =—J(Vp). 


Note that we can write (9.10), and the analogous result for f, as 


(9.22) S (934; =0=)> fay. 


If we define Q € C™°(0Q, End(C”)) to be the orthogonal projection that annihi- 
lates (a1,...,@,), then Qg = g and Qf = f, so we can apply Q to (9.15), and 
write 


(9.23) 8f=N+iY)g+Cog, 
where C' = C; + Co, with 
(9.24) Cig =[Q,N+iY]g, (C29); = 5° Q;*(Yoyn)(Qa)e. 
ke 
Note that, for each x € O2, R(Q(x)) = Ey = H2(OQ) (defined by (2.33)) in 


this case. 
Thus we need to analyze the pseudodifferential operator 


(9.25) r=N+iY +C € OPS'(8Q), 


which we claim to be hypoelliptic. The principal symbol is given by 


(9.26) oO +(x, €) =|6| + T(z, &), T(z, €) = (Y, EN 


which is < 0 everywhere and vanishes to second order on the ray bundle gen- 
erated by J'(dp), moe we will denote as }t Cc T*(OQ) \ 0. Thus =* is the 
characteristic set of 

Since og+ = Oy4iy vanishes to second order on }*, it follows that C, = 
[Q,N +iY] € OPS°(AQ) satisfies 


(9.27) oc,(a,€)=0 on St. 


It will turn out that this implies that the presence of C’, does not affect the hypoel- 
lipticity of OT. 

The operator C2 (also of order zero) requires further study. If we fix p € 0Q 
and rotate coordinates so that T,,(OQ)) is given by {yn = c}, then Q(p) annihilates 
the last component of a vector in C”, and we have 


9. Reduction to pseudodifferential equations on the boundary 573 


(9.28)  Cag(p) =2({ D> as... D> = o,0 
k<n—1 OF 1 k<n—1 On-1 
Note that the Levi form arises here: 
OPK Op 
9.29 2 =-4 = -4L£,,; 
( ) OZ; 02; 02% kj 
on OQ. Thus, for any g,h € L?(00,C"), we have 
(9.30) (Cag. h)i2 = ~4 f £(Q9,Qh) as. 
AQ 


If g and h are sections of EF — 00, we can omit the Qs in (9.30). 

We have used the fact that VM = —/—Ax mod OPS®(0Q) in the symbol 
calculation (9.26), where Ax is the Laplace operator on X = O02. We next make 
use of a finer analysis of NV’, given in §C at the end of this chapter, which says 


(9.31) N=—-V/-Ax+B, BEeOPS (d9), op(z,é)= ; Tr (AyPe), 


where Ay is the Weingarten map (arising from the second fundamental form 
of OQ ¢c R?”) and PP is the orthogonal projection of T,,(0Q) onto the linear 
subspace annihilated by €. Thus 


(9.32) —O* =/—Axy —iY+B,, By =—-B-—CeOPS*(aQ). 


In (9.29) we have related the principal symbol of the most important part C2 of C 
to the Levi form. If we use (B.20), we can write the principal symbol of B on )* 
as 


(9.33) on(z,7) = 2(Tr rag n= J*(dp), 
where we use the Hermitian metric on (02) to produce the section 
feces (29, End 5) 


from the Levi form. Hence 


(9.34) op, (x,£) = -2| (Tr £)1 — 2£Q] + (2,8), 


where (x, €) vanishes on +. 
The action of Ot on sections of E is of major interest, but it is convenient to 
analyze LI* on general functions u € L?(00,C"). To do this, it is convenient to 


574 12. The 0-Neumann Problem 


replace LQ by 


Lo =£Q 4+ r(x)(I- Q), 


where (x) is a positive function to be specified later. Thus, we replace (9.34) by 


(9.35) 


From now on, we work with this modified Ot. 


op, (2,€) = 2 [cr LI —2Lo| + B(x, €). 


The structure of 1 is to some degree simplified by composing on the left by 


(9.36) 


~, defined by 


_ = /-Ax +iY + Bo, 


where By € OPS°(0Q) will be specified shortly. Note the different sign in front 
of iY. Thus C1 is elliptic on X*; its characteristic set is U~, the ray bundle in 
T* (OQ) \ 0 generated by —.J*(dp). We have 


(9.37) 


t=-Ay4+Y?4+F, 


where F € OPS'(0Q) is given by 


(9.38) 


F= (V-Ax + iY) B, -i| -Ax,Y] see (V-Ax - iY) + BoBy. 


Since 


+i| =Ax,Y] = | V-Ax, V-Ax + iY| 


and —Ax +iY is doubly characteristic on -*, we see that 


(9.39) 


Ov—ax y](@,€) =O on stuD-, 


Now 1(x,€) = £|€| on S*. Consequently, the principal symbol of F' satisfies 


(9.40) 


_ 2|Elon, (2, €) on DT 
COE) lel aesfae\ on] 


Thus, if Bz is chosen so that 


(9.41) 


then 


(9.42) 


If 


op,(@,§) = 2|(tr ie 2£q| on X, 


on(a,€) = —4| (Tr Dre 2£q|r(«,€) ire OES, 


~ is constructed in this fashion, we have 


9. Reduction to pseudodifferential equations on the boundary 575 


(9.43) += =Ay +Y*—ia(a)Y +R=O)+ 8, 


with 
(9.44) a(x) = 4|(tr LI -2Lo|, RE OPS'(A0), op(x,€) =0 ond. 


As we have suggested, R will play a minor role in the analysis. Now the opera- 
tor Oh = —Axy + Y? + ia(x)Y is a second-order differential operator, doubly 
characteristic on % = t+ U 7. It is essentially the “Kohn Laplacian” on 02. 
We now derive an analogue of the “1 /2-estimate” for 0, + R. In the analysis, R 
(or perhaps R’ or R;) will denote an arbitrary element of OPS! (dQ) (sometimes 
a differential operator) whose principal symbol vanishes on ©; it might vary from 


line to line. We begin with an estimate on ( (4,+ R)u, u) P when wu is supported 
L 


on an open set O C O diffeomorphic to a ball in R?"~!. Let {Xj : 1 < j < 2n} 
be a smooth, orthonormal frame field for §(0Q) over O, such that X,,4 j= IX;. 


Lemma 9.1. Assume L is positive-definite on 9.2(O0Q), for all x € OQ. Also 
assume n > 3. Ifu € C§°(O, C”), then, for some C; > 0, 


(9.45) Re((Ch+R)u,u) > Cod. |Xjullis + Collullizae — Callullze. 


Proof. Note that 


2n 
(9.46) Ax +Y?=-S°X7+R. 
j=l 
Now set 
(9.47) Z,=X5—-iXjan, 25 = Xp +iX jan. 
We have 
Ve ee oD ere lig, Z| 
yas! | | gt+n 9 JIM 7I 
(9.48) ; 
Zj2Z; =X? + X34 512i: Z;].- 


If we use (B.7), and recall that Y = —J(Vp), we have 


(9.49) 5 (23,Z5] = -if,(x)Y+R,;, G;(x) = 4£(X;,X;) = 4(LX;, X;). 


576 12. The 0-Neumann Problem 


(The factor of 4 arises from a slightly different definition of Z; and Zi in (B.10).) 
Note also that 


(9.50) Gj2; =—2,23 + Ry, 2,2; =-212; + Ri. 


Hence, since }> 3; = 4 Tr L. 


—-Ax+Y?= —. Zt + 4i(Te LYY + Ri 
(9.51) 
= S- 252; —Ai(Tr L)Y + Ro. 
Thus, 
pt R=S° ZZ; +iAi(e)¥ + Ry 
(9.52) 
= S025 2; —iAa(x)Y + Ro, 
where 
(9.53) Ai(z) =8£9, Ao(x) = 8((Tr Dix La). 


Recall that on = LQ + 7(x)(I — Q). As long as 7(x) > 0, Ai(z) is a 
positive-definite matrix function. Also, as long as n > 3, Tr L£ exceeds any sin- 
gle eigenvalue of £, so we can pick (x) > 0 small enough that A(z) is also a 
positive-definite matrix function. 

Given that Aj(x) and A (2) are positive-definite, we want to take a “convex 
combination” of the two expressions on the right side of (9.52) and obtain an 
expression for which the estimate (9.45) is obvious. Let y;(x,£) € S° be real- 
valued and satisfy 


(9.54) yj(t,E)>56>0, ilz,€)? + y2(z, 6)? =1 


Then the operators y;(x, D) are elliptic, and we have 


p+ R=i(z,D)* >~ 2523 91(2, D) 
(9.55) + 92(x,D)* S> ZZ; p2(a, D) 
+ i] Ai(e)gr(@, D) — Aa(x)ya(#, D)]¥ + FR’. 


Now the operator 


(9.56) V = i[ Ai(2)pi(x, D) — Aa(2)ga(x, D)]Y 


9. Reduction to pseudodifferential equations on the boundary 577 


has symbol on & given by 


(9.57) ov(z,é) =+ [Ar(e)9(e, €) — Ao(x)yo(a,€)||é| on EF. 


Given that A; (x) is positive-definite, we see that oy (x, €) is positive-definite on 
=* as long as Yo(z, €) is sufficiently small on ©*; similarly, if Az() is positive- 
definite, then oy (x, ) is positive-definite on =~ as long as y1(2, €) is sufficiently 
small on X~. 

Thus, under the hypotheses of Lemma 9.1, we can arrange 


(9.58) oy (z,€) >colE|\E on ©, co >0. 
Now we can write 
(9.59) V=W+R, ow(2,€)>col€\l on T*O\0, 


and deduce from (9.55) that 


Re(( i+R)u,u) = 
om) = So {llZjerl@, Dyullis + [1Zj¢2(x, Duis } 
+ Re(Wu, v) 72 + Re(Ru, u)p2, 
Garding’s inequality implies 
(9.61) Re(Wu,u)z2 = Collull are — Calleulli2. 


If we note that [Z;, 91(2, D)] and [Z, yo(x, D)| belong to OPS°(Q) and use 
elliptic estimates, we see that the sum over 7 on the right side of (9.60) is 


2n 
(9.62) > Cod) Xjullz2 — Cillullze- 


j=l 


Finally, given R € OPS'(90), or(x,€) = 0 on &, we can write 


(9.63) R= 5_S;X;+So, 5; € OPS°(AN) 
and obtain 
(9.64) (Ru, w)| < C >> | Xjullzallullce + Cllullze. 


From these estimates, we have (9.45). 


578 12. The 0-Neumann Problem 


Note that since Re(( ot R)u,u) = < lull gs2||(Ge + R)ul| 7-1/2, the 
inequality (9.45) implies the estimate 


(9.65) do Xsullz2 + [lulz S Cle + Ryulli-a/2 + Cllullze, 


for u € Cx°(O,C"”). 
We can localize the estimate (9.65) as follows. Given Wo € C§°(O), we see 
that 


(9.66) wo(x) (Oy + R) — (Cy + R)vo(z) = R’. 


Assuming 7); € Co°(O), ~j+41 = 1 on supp ~,;, we have 


(9.67) vo(Oy + R) — di (Oe + R)vo = 1 Ro + V1 [H2, BR’). 


Applying (9.65) with u replaced by qu, we then have 
Ly |X; (Wow) IZ2 + [Youll Fa/2 


(9.68) < Cl\vo(Os + R)ull3,-12 + Cy |X; (wou) ||3,—-1/2 


+ Cl|voullZ2 + Cllullz—1, 


for u € C™(0N, C”). 
If we cover OQ by a finite collection of open sets O,, diffeomorphic to balls, 
and sum the resulting estimates, we obtain a global estimate of the form 
(9.69) 
[Vsyull22 + llell2pa/2 < Cle + Ryull2,-r/2 + CllV gull2,—r/2 + Clull3e, 


for all u € C°(0,C”), where, for each component u; of u, Vsu;(x) is the 
orthogonal projection of Vu; (x) € C@T,(0Q) onto C@pHz(OQ), so Vsu(z) € 
C” @p Hx (OQ). We can write 


(9.70) |V5ullzp-1/2 < ell Vgullg2 + Cle) ||Veullz—1 


and absorb the term ¢||V5u||7,2, obtaining 


(9.71) IVeullz2 + llellgay2 < Cll(Qe + Ryullz-s2 + Cllullze- 


We can obtain higher-order estimates as follows. With A = /—Ax + I, we 
have 


(9.72) AF(O, + R) — (OQ, + R)A* = R’A®, 


9. Reduction to pseudodifferential equations on the boundary 579 
so applying (9.71) with u replaced by A*u yields 


[Vs Afullz2 + APullzpa/2 


(9.73) : 
< CYA‘ (Oy + RyullF1j2 + CRAP ull Za. + Cl|A*ull?. 


Now we have ||R’A*ull2,_1/. < Cl|V5A*ullZ,—12 +Cl|A*ull?,_1/2, so standard 


methods yield the a priori estimate 


(9.74) Vyull ze + lull zers/2 S Cll(Gs + Ryullzn—r2 + Cllullze, 


for all u € C™ (0, C”). 
We can go from the estimate (9.74) to the associated regularity theorem: 


Theorem 9.2. [f Q is a bounded, strongly pseudoconvex domain in C” and 
n > 3, then the operator Ll, + R given by (9.43)-(9.44) is hypoelliptic. If 
u € L7(0Q,C%), then, for any s € [—1/2, 00), 


(9.75) (O, + Rue He => ue H!, 


Proof. We first establish the local version of (9.75), for s = k—1/2, k = 0,1, 2, 
and so on. Let © be a coordinate patch on OQ, identified with a ball in R?"*?. 
Take w; € C§°(O) such that wj;41 = 1 on supp yj. Let p € C§°(R?”*1) 
satisfy y(€) = 1 for |€| < 1, 0 for |&| > 3/2. Consider the following families of 
operators, for € € (0, 1]: 


9.76)  Jew=dr(2)p(eD)po(ax)u, Kou = o3(2)p(2-'eD)po2(x)u. 


We have 

(9.77) Je, Ke boundedin OPS}, O(e~*) in OPSZG, 
and 

(9.78) K-J- — J- bounded in OPS; ¢°. 


The formula (9.46) yields 


(9.79) Ke(Os + R) — (0, + R)Ke = — > _[Ke, X}] — i[Ke, oY] + [Ke, B, 
which is equal to 
(9.80) yak, Bje bounded in OPS} y, O(e~") inOPSzG. 


Also we have 


580 12. The 0-Neumann Problem 
(9.81) Byj-RjJ- bounded in OPS, - 
Now, we apply (9.74) with u replaced by J-u, to get 


[Vs Jeullire + || JeullZasiy2 
< Ol|Ke(O, + R)ullzn—1/2 
+ C7 ||BieRj Jeulljnap + Cll Jeullzs 
< Ol|(Qe + Ryullye-sy2 + Cllullz2- 


(9.82) 


Passing to the limit ¢ — 0 yields the local version of the regularity result (9.75), 
fors = k—1/2, k=0,1,2,..., and the result (9.75) for general s € [—1/2, co) 
can be deduced via an interpolation argument. 


We are now in a position to prove the main result of this section. 
Theorem 9.3. [f Q is a bounded, strongly pseudoconvex domain in C”, and 


n> 2, then the operator (1* given by (9.25) is hypoelliptic. If u € L?(0Q,C"), 
then, for any s € [1/2,00), 


(9.83) Tuc He ue He. 


Proof. If n > 3, this is immediate from Theorem 9.2. It remains to deal with the 
case n = 2. 
What happens to the argument involving 0~-Ot = 0) + R when n = 2 is 


that we cannot pick y; (zx, ) to satisfy (9.54) and arrange that oy (2, €) . given 
by (9.57), be > co|€|, with co > 0. The reason is that Az is not positive definite. 
Recall from (9.53) that Ag(x) = 8 ((Tr L)I — La) , so if n = 2, zero must be an 
eigenvalue of Ao(x), V « € OQ. This makes it impossible to make oy (x, &) > 
co|€| on U~. However, we can still arrange that oy (x, €) > co|é| on UT. In fact, 


for this we can just take y)(a,€) = 1— 4, yo(a,€) = 4, for small, positive 6. 
To fix LI, + R, we merely alter it on a small conic neighborhood of ~. Set 


(9.84) p>=l,4+R4+5S, 


where S € OPS', S(«,€) is supported on a small conic neighborhood of =~, 
and furthermore S(x,€)|__ > ci ||, for sufficiently large, positive c,. Then the 


arguments used to prove Lemma 9.1 and Theorem 9.2, starting with (9.58)—(9.60), 
show that L], is hypoelliptic and 


(9.85) Chu € H® ue H*t!, 


Since Oy is equal to + microlocally near + and [1+ is elliptic away from 
=, this is enough to complete the proof of Theorem 9 .3. 


Though we will not pursue details, we mention that there are constructions 
of parametrices for various classes of hypoelliptic operators with double char- 


Exercises 581 


acteristics which include the operators studied above. Constructions making use 
of Fourier integral operators are given in [B1, Sj], and [Tr2]. Another approach 
makes use of analysis on the Heisenberg group; this was initiated in [FS] and pur- 
sued in a number of papers, including [BG, BGS, BS, D, Gel, GS, Pon, RS, Tay], and 
[Tay2]. The effectiveness of the (2n — 1)-dimensional Heisenberg group arises as 
follows. Each hypersurface S Cc C” gives rise to a distinguished subbundle of 
T'S whose fibers are invariant under multiplication by 7. If S is the boundary of 
a strongly pseudoconvex domain, this bundle gives S the structure of a contact 
manifold. Now, it is a consequence of the Darboux theorem that any two con- 
tact manifolds of the same dimension are locally contact equivalent. Hence the 
Heisenberg group, with its natural contact structure, forms a good local model for 
boundaries of strongly pseudoconvex domains. The Heisenberg group approach 
provides a convenient way to obtain estimates in other function spaces, such as 
L”-Sobolev spaces and Holder spaces, on which results were first obtained, by 
other methods, in [Ker]. 

The reduction of the 0-Neumann problem to the study of 1+ was made in 
[GS], for (0,1)-forms, on a strongly pseudoconvex manifold, with a special met- 
ric, typically different from the Euclidean metric, called a Levi metric. A special 
property of a Levi metric is that £, arising as in (9.33) and (9.34), is a multiple of 
the identity. A reduction of the 0-Neumann problem for (0,1)-forms, with a gen- 
eral metric, was made in [Cha]. The analogous study for (0, q)-forms, g > 1, is 
made in [LR], for a Levi metric, and in [BS] for a general metric. In these analy- 
ses, 2 can be a general strongly pseudoconvex complex manifold, not necessarily 
a domain in C”. In §10 we will derive estimates for the 0-Neumann problem on 
such manifolds, via the sort of energy-estimate approach used in §§2-5 and 8. 
The details of the reduction to the boundary made in this section would have to be 
modified to treat the more general situation, since we made use of the fact that 
is (a constant multiple of) the Laplace operator, acting componentwise, on forms 
on a domain in C”, with its standard flat metric. 

While we have emphasized C’°° regularity, there are also results on the ana- 
lytic regularity of solutions to the 0-Neumann problem when QQ. is real analytic 
and strongly pseudoconvex, in [Tar, Trl], and [Gel], the latter two making use of 
analytic pseudodifferential operator calculi on 02. 


Exercises 


1. Work out the formula for 0+ when 0 is the unit ball in C”, using (C.29), with m = 2n. 
2. Extend the results of Theorems 9.2 and 9.3 to all s. (Hint: For any invertible, elliptic 


A? € OPS?* (AQ) with scalar principal symbol, write 
A~? (Cb + RJA? =O) + R’.) 


3. Show that the regularity results of §5 follow from Theorem 9.3. (Hint: If U solves 
U = F, with homogeneous 0-Neumann boundary conditions of the form (1.12), 
write U = Up + u, where Up solves the Dirichlet problem 


582 12. The 0-Neumann Problem 


Uo = F onQ), Uo| =0, 
aa 


and wu solves (9.1), with 


f =-o5+(a, v)BUo) -) 


4. Show that under the hypotheses of Theorem 9.3 (but with no restriction on s € R), 


(9.86) tu € H°(OQ) => u € H*(AQ) and Vague H* (AN). 


Let v = Plu. If (9.86) holds, for some s > —1/2, then 
v € Ht? (Q), 


5. Let P bea first-order differential operator with constant coefficients, acting on A?" (Q). 
Suppose 


(9.87) Pv i A(n) oe + Pranv, 
where v is the inward normal, and suppose 

(9.88) —A(x)|€| + oP, (x, €) vanishes on Dt. 
Show that if v is as in Exercise 4, then 


(9.89) Pv € H°(Q). 


(Hint: Pv is harmonic on Q. Write Pv = w = A(x)Nu+t Panu, and show that 
aa 
w € H®-1/2(9Q),) 
6. Suppose v satisfies the hypotheses of Exercise 4 and, in addition, that u = v isa 


an 
section of FE, so that Qu = u. Show that the conclusion (9.89) of Exercise 5 still holds, 
when the hypothesis (9.88) is weakened to 


(9.90) (-4@lel + OPy, (2, 2) Q(a) vanishes on DT. 


7. Show that Exercise 6 applies to Ov. Using this, establish (6.19). 


10. The 0-equation on complex manifolds and almost 
complex manifolds 


Let M be a compact C’°-manifold with boundary. We can assume M is contained 
in a smooth manifold O without boundary, such that the interior / is open in O. 
An almost complex structure on M is a smooth section J of End(T’M) such that 
J? = —I. If there is such a structure, the real dimension of 14 must be even, 
say k = 2n. Thus, for p € M, T,M, regarded as a complex vector space, has 
complex dimension n. 

A (0, 1)-form on M is a section of the complexified cotangent bundle CT* M 
of the form 


10. The 0-equation on complex manifolds and almost complex manifolds 583 
(10.1) a=B-iJ*B, 


where /3 is a section of T*M, and J‘ : TM — T; M is the adjoint of J. Sim- 
ilarly, a (1,0)-form on M has the form a’ = 6 + iJ*G. We have vector bundles 
A°!M and Al? M, and clearly CT*M = A°!M@Al°M. An obvious algebraic 
procedure yields subbundles A?4M of CATT*M, r = p+ q, and 


(10.2) CA'T*M = G arm. 
ptq=r 


We also use A?-4(1/) to denote the spaces of C’-sections of these bundles. 
Let II,,, denote the natural projection of CA"T* M onto A?:4M. We define the 
operators 


(10.3) 8: A™9(M)—+ AP7*1(M), 8: AP4(M) —+ AP++9(M) 
by 
(10.4) Ou =TIpq+i1 du, Ou =TIp41,q du. 


The basic case of an almost complex manifold is a complex manifold. For 
example, we say O is acomplex manifold if it has a covering by coordinate charts 
(into open subsets of C”) whose transition maps are holomorphic. Then M Cc O 
is a complex manifold with boundary. In such a case, 0 and 0 are as defined in §1 
(in any local holomorphic coordinate patch), and the following relations hold: 


(10.5) 0=0, 0?=0, d=0+8. 


These relations need not hold for an arbitrary, almost complex manifold, but it is 
readily verified that if any one of them holds, so do the other two. In such a case, 
the almost complex structure is said to be integrable. Thus, for a complex mani- 
fold, the almost complex structure is integrable. The converse to this is known as 
the Newlander—Nirenberg theorem; any manifold with an integrable, almost com- 
plex structure has a holomorphic coordinate chart. We will say more about this 
later in this section. There is also a direct characterization of the integrability con- 
dition in terms of J, which we will not need for our analysis, but we will mention 
it in the exercises. 

In the rest of this section, we assume M/ has an integrable, almost complex 
structure. As in earlier sections, we are interested in the equation 


(10.6) du=f on M, 


for u € A°4(M), given f € A°4+1(M), satisfying Of = 0. For this to lead to a 
O-Neumann problem, we need an operator 


584 12. The 0-Neumann Problem 
(10.7) O : API+1 (M1) — A”9(™M), 


a formal adjoint of 0. We assume M has a Riemannian metric with the property 
that J : T,M — T,M is an isometry, which can be obtained from an arbitrary 
Riemannian metric by averaging over the action of {J, J, J?, J°}. 

The Riemannian metric yields both a volume element on M and a Hermitian 
metric on CA*7* M, hence inner products (u,v) ;2 for u,v € A?4(M), and 
Hilbert spaces L?(Q, A?:7), as well as the operator O . As in (2.2), we can define 


(10.8) D4 = {ue AP4(M) : o5*(z,v)u = 00n 0M} 


and then consider 


(10.9) Q(u,v) = (Ou, dv) 72 +(0u,d'v)p2, uve D4. 


We can define the subbundle §(0M/) of T(0M) in a fashion similar to that 
done in §2; given p € OM, 


(10.10) Hp(OM) = {X € T,(OM) : JX € T,(OM)}. 


We can define the notion of a strongly pseudoconvex manifold, though it is more 
convenient to use an approach of §B than that of §2. Suppose p € C®(M), p <0 
on M, p = 0 on OM, |Vp| = 1 on OM. Then, as in (B.14)-(B.15), define the 
Levi form as a quadratic form on §(0M) by 


0 2s (JX, X],a) = (da)(X, JX), a=J*(dp). 


If £(X, X) > 0 for all nonzero X € §,(0M), we say M is strongly pseudocon- 
vex at p. 

The version of Morrey’s inequality available in this setting is a little weaker 
than (2.1). We will prove the following. 


Proposition 10.1. Jf M is strongly pseudoconvex, then, for some C > 0, all q>1, 


(10.12) |[Dul|Z2 + |D°ull32 + ljullz2 > C / lul2 dS, Vue D™, 


OM 


It will suffice to establish (10.12) when u € D°4 has support in some coor- 
dinate patch U intersecting 0M. We can assume that, over U, there is a smooth 
orthonormal frame field {w; : 1 < j < n} for Ab°, with w, = vV/2dp. Let 
{L; : 1 <j <n} be the dual basis, consisting of (complex) vector fields. Set 


(10.13) C2 =f0, ip) = ([Lj,Tu],wn), 1258241, 


10. The 0-equation on complex manifolds and almost complex manifolds 585 


One can verify that this is essentially equivalent to the Levi form. In particular, M 
is strongly pseudoconvex if and only if (C';,) is a positive-definite matrix, at each 
p € OM. Then, for u € A°-4(U), we can write 


(10.14) u= Sota. 


Here and below, all multi-indices will be increasing (see the conventions in §8). 
Note that 


(10.15) u€ D4 — > u,=0 ondM whenever n € a. 


Using these frames, we have the following formulas: 


du = S- sen (.) (Lpte)w? + Au, 
(10.16) 


where A and B are operators of order zero. 
Let us set 


(10.17) E(u)? = $0 |[LZetallz2 + llullZ2 + / |u|? ds. 
ak aM 
The following result will suffice to prove Proposition 10.1. 


Lemma 10.2. Assume that (Cj;,) is diagonal at p € OM, with eigenvalues 
Ai,--+;An—1. Let 6 > 0 be given. There is a neighborhood U of p such that 
ifq > 1andu € D°4 is supported in U, then 


(10.18) |[OullZ2+|]O ullzz = S> [Levallz2+ >) So Ax : \uo|? dS+R(u), 
ka 


ao kea ay 


where 
(10.19) |R(u)| < 6E(u)? + Cllul|z22(u). 


To begin the estimates, we use (10.16) to write 


Oullz2 = So | Leeall72 
k¢a 


(10.20) 
o _ = 
+> J sen e sgn (“,) (Lita, Lyua)i2 + Ro, 


586 12. The 0-Neumann Problem 


where |R2| < C >> ||Leual|z2||ul|z2 + Cllul|%2. Here and below, the quantities 
FR, will all satisfy estimates (either stronger than or) of the form (10.19). From 
this we can deduce 


|Oullz2 = Do |Lawall%2 
kia 


(10.21) 

_ S/ sgn (*) sen (2) (Lyte, Ljug)r2 + R3. 
Now 
eon —(Lyta, Lj ug) 12 = (Lj; Lyte, ug)r2+ Rg 


= —(Ljtta, Lyug) 2 + ([L;, Lele, ug)r2 + Rs, 


so the second term on the right side of (10.21) is equal to 


= S "sgn & sen & (Ljtte, Leug) 12 
+ S "sgn (*) sgn i) ([2j,Telue, ua) ra + Re. 


Meanwhile, the formula for d win (10.16) implies that the first sum in (10.23) is 
equal to 


(10.23) 


(10.24) =| wlze+ 
Putting together (10.21)-(10.24), we have 


Fullz2 +P ullz2 = >> Zeuallze 
kya 


+ S "sgn & sgn Ge ([2j,Tilua,ua) + Rg. 


To pass from here to (10.20), it remains to consider the second sum on the right 
side of (10.25). Making use of (10.15), one can show that if 7 € a andk € £, 
then 
(10.26) 

([2),TaJuasus) = V2(CjLntia,ug)iat+ Ry if j,k <n-1, 


(10.25) 


Rio ifj7 =nork=n. 
Now write 


(10.27) Cyn = Andjn + 05x,  05x(p) = 0. 


10. The 0-equation on complex manifolds and almost complex manifolds 587 


Note that a = @ for nonzero terms for which 7 = k, in the last sum in (10.25). 
Also 


(An6j¢ Ln ta, Ua) 52 = Andjk(Ua, Lntta) 12 


1 
+ Andjk i +(oL (2) tas Wa) dS + Ry 


(10.28) 2, 

1 

= —),6; Ua|? dS + Ryo. 
Jahan / tt ts 
OM 
Similarly, 
1 _ 

(10.29) (bjpLnUa;Ug)L2 = af bjpUatg dS + Ris, 


and if U is such a small neighborhood of p that sup |b;;,| is small compared to 6, 
this can also be denoted as Ry4. 
Combining (10.26)—(10.29) and summing, we have 


Se()(2)(eni), 


we |ta|? dS + R45. 
kea OM 


(10.30) 


Using this in (10.25), we have Lemma 10.2. 

We can now use the Friedrichs method to define an unbounded, self-adjoint 
operator £ on L?(M, A°-7), for any q > 1, such that D(£‘/?) is the completion 
of D°-4 with respect to the square norm Q(u, uv) + ||ul|72, where Q(u, v) is given 
by (10.9), and (Lu, v)p2 = Q(u,v) foru € D(L), v € D4. One difference 
between this situation and those that arose in $83 and 8 is that, while £ > 0, we 
might possibly have 0 € spec £L. The estimates of §83 and 4 and the regularity 
result of §5 extend without difficulty if M is strongly pseudoconvex, which we 
will assume in the rest of this section. 

We have 


(10.31) u€D(L), Lu= f € HI(M, A) => we H9*1(M, A%4); 
for g > 1; in particular, 

(10.32) Ker £ Cc A°4(M). 

Denote this space by H.°7(/). In fact, we have 


(10.33) HWM) = {ue D4: du=0=8 u}. 


588 12. The 0-Neumann Problem 
Let P;, denote the orthogonal projection of L?(M,A°%) onto Ker £L. Given f € 
L?(M, A°-*), let Gf denote the unique element u of (Ker £)*+ such that Lu = 
(I — P,,) f. Clearly, 
(10.34) G : Hi(M,A%) — H?t!(M, A®%), 
for g > 1. We have the following Hodge decomposition for u € A°4(M): 
(10.35) u= 00 Gut+d dGu+ Pru = Psu + Pyut Phu. 
Arguments used in §86 and 8 extend to show that these three terms are mutually 
orthogonal in L?(M, A°7), so Ps and P5» (as well as P;,) extend to bounded 
operators on L?(M, A°-7), which in fact are orthogonal projections. Such argu- 
ments as used before also yield 
(10.36) Ps, P53»: H1(M) —> H!~*(M), Ve>0. 

In connection with the Hodge decomposition, note that just as in (6.4), we have 
(10.37) w ED! —s Ow L ker IN A%4(M). 
We hence have the following extension of Theorem 6.2 and Corollary 8.5: 
Proposition 10.3. [fq > 1 and f € A°4(M) satisfies 
(10.38) Of =0 and f LH°%(M), 
then there exists u € A°I~'(M) satisfying Ou = f. 


Proof. With g = Gf € A®°4%(M), we have the decomposition into orthogonal 
pieces 


(10.39) f =00 g+0 Og + Pnf. 


Now (10.37), applied to w = Og, implies d Og . f, while the second hypothesis 
in (10.38) implies P;, f = 0. This gives 


(10.40) f=0( 9), 


so we have the desired result, with u = 0 g. 


Note also by (6.3) that if u € A°%-1(M) and f = Ou, then f | w when- 
ever w € D4 and 8 w = 0, so the condition f L H°4(M) is necessary for 
solvability. 


10. The 0-equation on complex manifolds and almost complex manifolds 589 


Since solving Ou = f is of primary importance, the last result suggests two 
objects of study: Determine when H°4(M/) = 0, and work out how to deal with 
the requirement that f | #°4(/) if you cannot show that H9:7(M) = 0. 

Here is an example of the first sort. Suppose J, is a smooth family of integrable, 
almost complex structures on a compact Riemannian manifold with boundary, 
parameterized by s € [0,a]. We can adjust the metric to depend smoothly on 
s and make each J, an isometry. Denote the resulting object by M,. We have 
operators O,, é., L,, etc.; we often drop the subscript when s = 0. Assume My 
is strongly pseudoconvex; then /, is strongly pseudoconvex if |s| is sufficiently 
small. 


Proposition 10.4. If ¢ > 1 and H°4(Mo) = 0, then H°4(M,) = 0 for |s| 
sufficiently small. 


Proof. The proof of the Morrey-type estimates and consequent derivation of the 
1/2-estimate yields 


(10.41) (I+ L,)~+ bounded in £(L7(M,, A), H/2(M,, A) 

for |s| small. Now, suppose s; + 0, H°4(M,,) # 0. Pick 

(10.42) Us; © HO (Ms,),  |lus;l|z2 = 1. 

Then (10.41) implies ||ws,|| 71/2 < A. Hence, passing to a subsequence, we have 
(10.43) Us, > Uo strongly in L?(M), weakly in H'/?(M). 

In particular, ||uo||;2 = 1. Now, via (10.37), we can say 

(10.44) tig MOD) tiga (DP, 

while, by the remark after (10.40), we can say 

(10.45) us, L Os, (A°*-1(0,,)) = up L a(ao#1M)). 

The conclusions of (10.44) and (10.45) imply that ug € #°7(Mo). Since ||uo|| 22 


= 1, this means H°7(Mo) 4 0, so we have a contradiction to the hypothesis that 
H°4(M,,) #0. 


Corollary 10.5. Under the hypotheses of Proposition 10.4, we have, for each 
gu, 


(10.46) Lz) bounded in £(H (M,, A), Hi+1(M,, A*)) , 


for |s| sufficiently small. 


590 12. The 0-Neumann Problem 


Proof. A check of the sorts of estimates arising in §4 shows that 
(10.47) (I+ £,)~! bounded in £(H4(M,,A°*), H7+1(M,, A), 


for |s| sufficiently small. Passing from this to (10.46) can be done by arguments 
similar to those used in the proof of Proposition 10.4. 


Let us give an example of a situation where Proposition 10.4 and Corol- 
lary 10.5 apply. Let O be a manifold of real dimension 2n, with an integrable, 
almost complex structure J, and fix p € O. Without loss of generality, we iden- 


tify a neighborhood U of p with an open set in C”, and suppose J coincides 
TpU 


with the “standard” complex structure on C”, which for now we denote Jo. 
We may as well suppose p = 0. Use the standard metric on C”, and consider 
Be = {z € C”: |z| < e}. For small ¢, B, is strongly pseudoconvex both for Jo 
and for J. Now we produce a family M,, of almost complex manifolds as follows. 
Asaset, M, = M = By = {z €C": |z| < 1}. We have y, : M, > Bs, given 
by ys(z) = sz, and we pull back J (restricted to B,) to get integrable, almost 
complex structures J, on M,, for0 < s < a (for some a > 0). Clearly, such J, 
and Jo fit together smoothly. Also, since Mog is a strongly pseudoconvex domain 
in C”, the results of §§2 and 8 imply H°4(Mo) = 0 for g > 1. Using this family, 
we will now prove the Newlander—Nirenberg theorem. This proof was given by 
Kohn in [K1], following a suggestion of D. Spencer. 


Theorem 10.6. [f O has an integrable, almost complex structure, then O has 
holomorphic coordinate charts, so O is a complex manifold. 


Proof. It suffices to show that any point p € O has a neighborhood B such that 
there are smooth complex-valued functions u; on B (i.e., uj € A%°(B)) that are 
holomorphic (i.e., Ou; = 0), and such that dui,..., dup, are linearly independent 
at p. 

Let us bring in the structure described in the preceding paragraph. Thus we 
have a family of small neighborhoods B, of p, blown up to M., with integrable, 
almost complex structures, and Proposition 10.4 and Corollary 10.5 apply to M, 
for |s| small. As a set, M, = M is the unit ball in C”. We will be done if we 
produce some so > 0 and wj,...,Un € A%°(M,,) such that 0,,u; = 0 and 
du, (0) are linearly independent. 

We write uj, = 2; + vjs, where the functions z; are the standard coordinate 
functions on C”, and we pick vj, to be convenient solutions to 


(10.48) OsVjs=Tjs ON Ms, Tis = —Os2j- 
Namely, we take 


(10.49) Vjs = O,L5Tjs- 


10. The 0-equation on complex manifolds and almost complex manifolds 591 


It follows from Proposition 10.4 and Corollary 10.5 that, for |s| sufficiently small, 
(10.49) is well defined, and, for each £ € Z*, 


(10.50) Iles ll xem) < Kellrjsll ecm), 


with Hy independent of s. 
On the other hand, since J, approaches Jo as s — 0, we have 


(10.51) IIrjsllaeqmy S Ces. 
Now if we pick £ > n+ 1, we deduce that 
(10.52) Ostjs=0, Ujs= 27+ js, |vjsllo1 < Cs. 


It is thus clear that, for s = sq sufficiently small, a desired coordinate system is 
produced. 


The reader can compare this argument with the proof of the existence of 
isothermal coordinates on a two-dimensional Riemannian manifold, given in §10 
of Chap. 5. 


Further work on the Newlander-Nirenberg theorem. 


The treatment of Theorem 10.6 given above was done in the setting of a C™ 
smooth almost complex structure. It would be possible to adapt the argument to 
treat the case where the coefficients of J have a finite degree of smoothness, say 
C*, but a straightforward attack would require k to be fairly large. The paper 
[NW] established Theorem 10.6 in the setting that J € C!t*®, e > 0, assum- 
ing the integrability condition, which can also be stated as the vanishing of the 
Nijenhuis tensor V(X, Y) = 2JN(X, Y), where 


(10.53)  N(X,Y)= I(LX, Y] —[JX, JY]) = (7x, vi+i< J¥]). 


See Exercises 1-7 below for more on NV. Note that if J € C!, then NV and N are 
well defined continuous tensor fields. 

Another proof was given in [Mal], also assuming J € C!+*. The approach 
there was to construct a holomorphic coordinate chart F’ of the form F = Go H 
(Malgrange factorization), with separate techniques applied to produce the factors 
Hand G. The construction involves an auxiliary map B, for which the vanishing 
of N yields an overdetermined elliptic system, forcing B to be real analytic and 
allowing one to finish with the version of Newlander-Nirenberg in the real ana- 
lytic category, where the Cartan-Kahler theorem (which can be established via the 
Cauchy-Kowalewsky theorem) applies. 

The scope of the Newlander-Nirenberg theorem was further broadened in [HT], 
which established the following. 


592 12. The 0-Neumann Problem 


Theorem 10.7. Assume the almost complex structure J on O satisfies 


1 
(10.54) JEC(O), r> e 
Then the Nijenhuis tensor N is a well defined distribution on O. If this distribution 
vanishes, then local holomorphic coordinates exist on O. 


The proof in [HT] is adapted from that of [Mal], though new analytical tech- 
niques are required to deal with the substantially rougher tensors that arise. This 
is accomplished in [HT] via use of paradifferential calculus, a collection of tools 
for nonlinear analysis that will be treated in Chapter 13 of this work. The results 
of [HT] were developed further in [HT2]. 

It is particularly significant that Theorem 10.7 allows for complex structures J 
that are rougher than Lipschitz. An application of this result arises in [LM]. In that 
work, one encounters an almost complex structure J that is smooth and integrable 
off a submanifold, in a neighborhood of which it is a bit rougher than Lipschitz, 
but smoother than C!/?. 


An important class of strongly pseudoconvex complex manifolds arises as fol- 
lows. Let X be a compact, real analytic manifold; we can regard X C T'X as the 
zero section. Then there is a neighborhood U of X in T'X that has the structure 
of a complex manifold, and U contains a strongly pseudoconvex neighborhood 
M of X, diffeomorphic to the unit ball bundle of X (given some Riemannian 
metric), so OM is diffeomorphic to the unit sphere bundle of X. The solution to 
the 0-Neumann problem on such M yields the result that there is a real analytic 
imbedding of X into Euclidean space R%. See Chap. 8 of [Mor] for an account 
of this. It was in the process of tackling this problem that the “Morrey inequality” 
was derived. 

In this section we have continued to restrict our attention to the case of strongly 
pseudoconvex manifolds. However, as discovered in [Hol], the basic estimate 
(10.12) holds for u € D?:4 under a more general condition, called “condition 
Z(q).” This condition is that (if dim 0M = 2n — 1) the Levi form has either at 
least n — q positive eigenvalues or at least g + 1 negative eigenvalues. A strongly 
pseudoconvex manifold satisfies condition Z(q) for all g > 1, and for a bounded 
domain in C” this is the only way condition Z(q) can be satisfied, at least over all 
of OM. But there are open domains MM with smooth boundary in compact, com- 
plex manifolds (such as complex projective space) which can satisfy condition 
Z(q) for some but not all g, by virtue of the Levi form on 0M having some neg- 
ative eigenvalues. A proof of the estimate (10.12) under condition Z(q) is given 
in [FK]. Also, [BS] analyzes the 0-Neumann problem via reduction to pseudodif- 
ferential operators on OM, under condition Z7(q). 


10. The 0-equation on complex manifolds and almost complex manifolds 593 


Exercises 


In Exercises 1-7, M is a Riemannian manifold with an almost complex structure J 
satisfying (X,Y) = (JX, JY), where (, ) is the Riemannian inner product. 
1. For X,Y € TM, set 
(X,Y) = (X,Y) +0(X, JY). 


Show that 


(X,X)=(X,X), (JX,Y)=i(X,Y), (X,Y) =(Y,X), 


so we have a Hermitian metric. Thus we will call such M a Hermitian manifold. 
2. Show that if MM is actually a complex manifold, the Lie bracket of two vector fields 
X +iJX and Y +7iJY has the same form, that is, 


N(X,Y) = I(LX, v7) —lIx. JY]) . (7x, Y] + [X, JY]) 


vanishes. Show that V(fX,gY) = fgN(X,Y) for f,g € C°(M), so N defines a 
tensor field of type (1,2). A related tensor N, defined by N(X,Y) = 2JN (X,Y), is 
called the Nijenhuis tensor. Show also that 


N(X,Y)=-N(Y,X), JIN(IX,Y)=N(X,Y), N(JX, JY) = —N(X,Y). 


3. Show that on any almost complex manifold, the vanishing of \V is equivalent to the 
integrability condition (10.5). 
4. Let V be the Levi—Civita connection on M, and set 


w(X,Y) = (X,JY), 
a 2-form on M. Show that 
2(VxJ)Y, Z) = (dw)(X, JY, JZ) — (dw)(X,Y,Z) + (vy, Z), x). 


A Riemannian manifold M is called a symmetric space if, for each p € M, there is 
an isometry tp : M — M such that ep(p) = p and Dip(p) = —I on T,M. If M is an 
almost complex manifold with metric as above (i.e., a Hermitian manifold), and is also 
a symmetric space, and if each isometry up preserves the almost complex structure J, 
then M is called a Hermitian symmetric space. 

5. Show that if / is a Hermitian symmetric space, then, for all vector fields X, 


VxJ=0. 


(Hint: Consider the tensor field F' = VJ, of type (1,2). Show that Gao = F'on M and 
that .,F = —F at p, so that F = —F at p, forall p € M.) 

6. Show that the almost complex structure of any Hermitian symmetric space is integrable. 
(Hint: Show that VJ = 0 > Vw = 0 => dw = 0, and then use Exercise 4.) 

7. More generally, a Hermitian manifold M is said to be a Kahler manifold if VJ = 0. 
Show that / is Kahler if and only if the almost complex structure J is integrable and 
dw = 0. 

8. Show that the 0-operator is well defined: 


8: C™~(M, E@ A°*) — C®(M, E@ A%*1), 


594 12. The 0-Neumann Problem 


for any holomorphic vector bundle E over the complex manifold with boundary M. 
Extend the results of this section to this case. 


B. Complements on the Levi form 


In this appendix we will give further formulas and other results for the Levi form 
on a hypersurface in C”. As a preliminary, we reexamine the formulas (2.7) and 
(2.8) in terms of the complex vector fields 


oO. _ @ 
(B.1) Z= Qua Z=) tee 


We assume that at each p € OO, u(p) = (u1,..., Un) belongs to 9, (OQ), defined 
by (2.13). As noted in §2, this hypothesis is equivalent both to Zp = 0 and to 
Zp = 0 on OQ. Then (2.7) simply says ZZp = 0 on OQ. Also, of course, 7Zp = 
0 on 02, and hence 

[Z,Z]p =0 on OO, 


but this is not the content of (2.8). To restate (2.8), note that 


(B.2) 271-3 (uw 52 = qo! }: 
J J 


ik 


Now let us apply the operator J that gives the complex structure of C”, so 


0 0 0 0 
(8.3) 1 oa; Oy,’ 1 5y, Ox,’ 
and hence 

0 _O 0 _O 
ee ee 7 "Oz; ee 7 "Oz; 
We have 


(B.5) W = J[Z,Z)=-i>~ (uF 0 _ Ou; 0 ) 


Oz OZ; 1 Uk OZk Oz; 


where to get the last term in parentheses from J applied to (B.2), we have inter- 
changed the roles of 7 and k. Hence 


(B.6) Wp=-2iRe | > i, oui OP 


B. Complements on the Levi form 595 


Now the quantity in parentheses here is precisely the left side of (2.8). Since the 
right side of (2.8) is clearly real-valued, we have 


(B.7) 5 (az, Z| ],dp) = DE on ao. 


Let a = J‘dp, so the left side of (B.7) is 1/2i times a([Z, Z]). Since 


(da)(Z,Z) = Z-a(Z) — Z-a(z) — a([Z, Z]) 


and a(Z) = dp(JZ) = —iZp = 0, we have (da)(Z, Z) = —a([Z, Z]), so (B.7) 
implies 


1 _ 
(B.8) S > Lixujtx = ——(da)(Z, Z), 


another useful formula for the Levi form. 
It is also useful to write these formulas in terms of 


0 0 
(B.9) X= S (fag tag) Un = fe+ ign, 


where f;, = Re ux, gx = Im ux. The hypothesis Zp = 0 is still in effect, so, for 
p € OQ, X(p) € H,(0Q) C T,0Q Cc R?”. Note that 


(B.10) WF=X—-iJX, W=X+iIX. 


Thus (B.8) implies 


(B.11) ie = —(da)(X, JX). 
Following the trail (B.7)>(B.8) backward, we note that (da)(X,JX) = 
X -a(JX) — (JX) - a(X) — a([X, JX]) and a(X) = 0 = a(JX), so 


(da)(X, JX) = —a([X, JX]), and hence 


(B.12) AS” Lyujte = a([X, JX]) = (JX, nea dp). 
jk 


Note also that, by direct calculation, 


(B.13) 4S ¢ L3pujtt, = H(X,X) + H(JX, JX), 


596 12. The 0-Neumann Problem 


where His the (2n) x (2) real Hessian matrix of second-order partial derivatives 
of p with respect to (%1,...,2%n,Y1,--+;Yn)- 

We can recast the Levi form in the following more invariant way, as done in 
[HN]. For a local section X of §(0Q), we will define 


(B.14) Ly(X,X) € #7 (AQ), 


where $?(0Q) C T*(OQ) is the annihilator of ,(0Q) C T,(AQ). To do this, 
we set 


(B.15) 4L(X, X)(a) = ([X, JX], a) 

= —(da)(X, JX). 
When a = J‘dp, this coincides with (B.11)-(B.12). This object is clearly invari- 
ant under conjugation by biholomorphic maps (i.e., under biholomorphic changes 
of coordinates). The property of positivity of £ is invariantly defined, since the 
real line bundle HP (0Q) has a natural orientation, defined by declaring that 
J'dp > 0. Thus we have the following: 


Proposition B.1. Jf © is strongly pseudoconvex at p € OO and if F : OU 
C C” is a biholomorphic map defined on a neighborhood of p, then F(ONQ) = 
Q. is strongly pseudoconvex at p = F(p). 


It follows readily from (B.13) that © is strongly pseudoconvex at any p € 0Q 
at which (2) is strongly convex. By Proposition B.1 we see then that any (local) 
biholomorphic image of a strongly convex 2 C C” is (locally) strongly pseudo- 
convex. 

We can also relate the Levi form to the second fundamental form of 02 as a 
hypersurface of R?”, using the following: 


Lemma B.2. /f IT is the second fundamental form of 0Q. < R?", and if X isa 
section of §(O0Q), then 


(B.16) II(X,X) = —PyJ Vx(JX) =—JPyn Vx(JX). 


Here, V is the Levi—Civita connection on OQ, Py is the orthogonal projection 
of R?” onto the span of N = —Vp (the sign chosen so N points inward), and 
Py is the orthogonal projection of R?” onto the span of JN. We denote the 
span of JN by §+(0Q), which is isomorphic to §°(0Q), via the Riemannian 
metric on 02. 

To prove the lemma, recall from §4 of Appendix C (Connections and Cur- 
vature) that if X and Y are tangent to 0O, then IJ(X,Y) = PyDxY, where 
Dx denotes the standard flat connection on R?”. Of course, also [I(X,Y) = 
DxY —VxY.Note that Dx (JY) = JDxY,s0 II( JX, X) = I1(X, JX) = 
Py Dx (JX) = Py J(DxX). Hence 


B. Complements on the Levi form 597 
(B.17) II(JX,X) = PyJ IT(X,X)+ PynJI VxX. 


Similarly, [7 (JX, JX) = PyJDjxX = PyJ II(JX,X)+PnJ VyxX, and 
substituting (B.17) yields 


IT( JX, JX) = PyJPvnJ I(x, X) + PyJPnJ VxX + PyJ VixX. 


Now, JPyJ = —Pyy, which is orthogonal to Py, so PyJPyJ = 0, and we 
have 


(B.18) IT(JX, JX) = PyJ VuixX = IPin VgxX. 


Replacing X by JX hence yields (B.16) and proves the lemma. 
We can add (B.16) and (B.18), obtaining 


II(X, X) + II(JX, JX) 
(B.19) 
= PyJ[VixX Ve) = Pad 


Comparing this with (B.12) and using the notation [J(X,Y) = II(X,Y)N, as 
in (4.15) of Appendix C, we see that 


(B.20) AN” L5_ujtty = 11(X,X) + (IX, JX). 
i,k 


This can also be obtained from (B.13), plus formula (4.25) of Appendix C. 

We will consider one more formula for the Levi form, in terms of the geometry 
of (OQ) as a subbundle of the trivial bundle OQ x R?” ~ OQ x C”. Associated 
to this subbundle there is a second fundamental form I J,,, defined as in (4.40) of 
Appendix C. A formula for [7 can be given as follows. Let R(OQ) denote the 
orthogonal complement of (02); this can be viewed as a real vector bundle of 
rank 2, generated by N and JN, or as a complex line bundle generated by NV. If 
Pg denotes the orthogonal projection of R?” onto &, then we have 


(B.21) IIg(X,Y) = PaDxY, 
when X and Y are sections of §(0Q). 

We want to relate [5 to the Levi form. It is convenient to use the previous 
analysis of JJ. Since Pg = Py + Pyn, we have 


IIg(X,X) = II(X,X) + PywDxX. 


As noted in the proof of (B.16), I7(JX,X) = PyJ Dx X, which is equal to 
JPyjn Dx X, so we have 


598 12. The 0-Neumann Problem 

(B.22) IIg(X,X) = 11(X, X) — J (JX, X). 
Substituting JX for X, we have 

(B.23) TIg (JX, JX) = 11(JX, JX)+ J 11(JX,X), 
and adding this to (B.22) and using (B.20), we obtain 


(B.24) gt x) + TIX) = 4(- L5nujt) N. 
ik 


As seen on 810, the notion of the Levi form extends naturally to a hypersurface 
in a complex manifold. See [BHLN] for a neat general treatment of the Levi form 
in the broader context of CR-manifolds. 


C. The Neumann operator for the Dirichlet problem 


Let M be a compact Riemannian manifold with boundary 0M = X. Then X 
has an induced Riemannian metric, and X © MW has a second fundamental form, 
with associated Weingarten map 


(C.1) Ayer Sa, 


defined as in §4 of Appendix C, Connections and Curvature. We take N to be the 
unit normal to X, pointing into M. 

Both M and X have Laplace operators, which we denote A and A x, respec- 
tively. The Neumann operator N is an operator on D’(X) defined as follows: 


Ou 
(C.2) Nf = ay u=PIf, 
where to say u = PI f is to say 
(C.3) Au=0 on M, u| =f. 
aM 


As shown in §§11 and 12 of Chap.7, VV is a negative-semidefinite, self-adjoint 
operator, and also an elliptic operator in OP.S1(X). It is fairly easy to see that 


(C.4) N =-—/—-Ax mod OPS®(X). 


Our main purpose here is to capture the principal part of the difference. 


C. The Neumann operator for the Dirichlet problem 599 
Proposition C.1. The Neumann operator N is given by 
(C.5) N =-./-Ax +B mod OPS™1(X), 
where B € OPS®°(X) has principal symbol 


coe (ran 4882) 


Here, At, : TX — TX is the adjoint of (C.1), and (, ) is the inner product on 
T; X arising from the given Riemannian metric. 

To prove this, we choose coordinates « = (21,...,2Zm_—1) on an open set in X 
(if dim M = m) and then coordinates (x,y) on a neighborhood in M such that 
y =0on X and |Vy| = 1 near X while y > 0 on MM and such that x is constant 
on each geodesic segment in M normal to X. Then the metric tensor on M has 
the form 


C7) (six(.v)) = a - 


where, in the first matrix, 1 < j,k < m, and in the second, 1 < 7,k < m-—1. 
Thus the Laplace operator A on M is given in local coordinates by 


Au = 9 ¥70;(g'/2g/aiu) 
(C.8) = 1-2 Qy(RM Oyu) + RN; (AY?AIMOgU) 
ay ts hy 9 Ei D 
_ yutoa yu + (y, x, a) tl 
where, as usual, 


(C.9) g= det(gj.), h= det(hjx); 


we set hy = Oh/Oy, and L(y) = L(y, #, Dz) is a family of Laplace operators on 
X, associated to the family of metrics (rix(v)) on X,so L(0) = Ax. In other 
words, 

2 Thy 
(C.10) Au=Ojut+a(yjdyut+ Liy)u, aly) = ae 


We will construct smooth families of operators A;(y) € OPS'(X) such that 


(CAI) + aly)By + L(y) = (2 — Arly)) (2) + Arty), 


600 12. The 0-Neumann Problem 


modulo a smoothing operator. The principal parts of Ai(y) and Ao(y) will be 
— L(y). It will follow that 


(C.12) N = —A (0) mod OPS~™(X), 


and we can then read off (C.5)-(C.6). 
To construct A;(y), we compute that the right side of (C.11) is equal to 


(C.13) Oy — Ai(y)Oy + Az(y)Oy + Ad(y) — Ar(y)Aa(y), 


so we need 


(C.14) 


Substituting Ao(y) = A1(y) + a(y) into the second identity, we get an equation 
for A,(y): 


(C.15) Ai(y)? + Ar(y)a(y) — AL (y) = —L(y) + a’(y). 
Now set 
(C.16) Ai(y) =A(y)+ By), Aly) = V—-L(y). 


(C.17) 


Granted that B(y) is a smooth family in OP.S°(X), the principal part Bo(y) must 
satisfy 2Bo(y)A(y) = A’(y) — a(y)A(y), or 


(C18) Boly) = 5A'y)A@W)! — jaly) mod OPS“"(X), 


We can inductively obtain further terms B;(y) € OP.S~/(X) and establish that, 
with B(y) ~ >) j30 B;(y), the operators 


Ai(y) = V-L(y) + Bly), Ae2(y) = V—-L(y) + Bly) + ay) 


do yield (C.11) modulo a smoothing operator. Details are similar to those arising 
in the decoupling procedure in §12 of Chap. 7. 
Given this, we have (C.5) with 


C. The Neumann operator for the Dirichlet problem 601 


(C19) —B = Bo(0) +.a(0) = SAYA)! zs 50(0) mod OPS-1(X). 


In turn, since A(y) = ,/—L(y), we have 
(C.20) A’(0)A(0)~* = 5E/(0)L(0)~! mod OPS~1!(X). 
Hence 
(C.21) ae ((L)L(0)"" +0 x)) 
; 7 7, (0a) ): 


To compute the symbol of B, note that 


(C.22) o1(0)(#,€) = — >, Oyh?* (0, 2) EE, 

while, of course, o1(0)(2,€) = — > hI*(0, x) = —(E,€). Now, (4.68)- 
(4.70) of Appendix C, Connections and Curvature, we have 

(C.23) S| dyhjn(0, 2)U; Vie = —2(AnU, V), 

so 

(C.24) D5 Oy hI* (0, w)EjEx = 2 ANE, 6). 

Thus, 

(C.25) OL'(0)L(0)-1(@, €) = oS 


Next, for h = Det(h;,) = Det H, we have hy = h Tr(H~'H,,). Looking in a 
normal coordinate system on X, centered at x9, we have 


(C.26) —“ (0, zo) = 2% hj;(0, 20) = —2 Tr An, 


the last identity by (C.23). Combining (C.25) and (C.26) yields the desired for- 
mula (C.6). 
The following alternative way of writing (C.6) is useful. We have 


1 
= Tr(Ay Pe), 


(C.27) op(x,€) = 5 


where, for nonzero € € T*X, Pe is the orthogonal projection of T*_X onto the 
orthogonal complement of the linear span of €. Another equivalent formula is 


602 12. The 0-Neumann Problem 


1 
(C.28) on(x,€) = 5 Tr (An Pe), 


where as is the orthogonal projection of T,X onto the subspace annihilated by €. 


To close, we mention the special case where M is the closed unit ball in R™, 
so OM = $1, It follows from (4.5)-(4.6) of Chap. 8 that 


(C.29) N=-V/-Ax +c, +¢m;, pee amd 


2 


in this case. Note that, in this case, Ay = J, so this formula is consistent with 
(C.5)(C.6). 

We mention that calculations of the symbol of VV in a similar spirit (but for a 
different purpose) were done in [LU]. Another approach was taken in [CNS]. 


References 


[AHLM] A. Andreotti, C. D. Hill, S. Lojasiewicz, and B. MacKichan, Complexes of 
differential operators, The Mayer—Vietoris sequence, Invent. Math. 35(1976), 
43-86. 

[AV] A. Andreotti and E. Vesentini, Carleman estimates for the Laplace—Beltrami 
equation on complex manifolds, Publ. Math. Inst. Hautes Etudes Sci. 25(1965), 
81-130. 

[BDT] P. Baum, R. Douglas, and M. Taylor, Cycles and relative cycles in analytic 
K-homology, J. Differential Geom. 30(1989), 761-804. 

[BvE] P. Baum and E. van Earp, The index theorem for Toeplitz operators as a corollary 
of Bott periodicity, Q. J. Math. 72 (2021), 547-569. 

[BFG] M. Beals, C. Fefferman, and R. Grossman, Strictly pseudoconvex domains in 
C”, Bull. AMS 8(1983), 125-322. 

[BG] R. Beals and P. Greiner, Calculus on Heisenberg Manifolds, Princeton 
University Press, Princeton, NJ, 1988. 

[BGS] R. Beals, P. Greiner, and N. Stanton, L? and Lipschitz estimates for the 
0-equation and the 0-Neumann problem, Math. Ann. 277(1987), 185-196. 

[BS] R. Beals and N. Stanton, Estimates on kernels for the 0-equation and the 
0-Neumann problem, Math. Ann. 289(1991), 73-83. 

[BL] S. Bell and E. Ligocka, A simplification and extension of Fefferman’s theorem 
on holomorphic mappings, Invent. Math. 57(1980), 283-289. 

[Bog] A. Boggess, CR Manifolds and the Tangential Cauchy—Riemann Complex, CRC, 
Boca Raton, FL, 1991. 

[B1] L. Boutet de Monvel, Hypoelliptic operators with double characteristics and 
related pseudodifferential operators, CPAM 27(1974), 585-639. 

[B2] L. Boutet de Monvel, On the index of Toeplitz operators in several complex 
variables, Invent. Math. 50(1979), 249-272. 

[BGu] L. Boutet de Monvel and V. Guillemin, The Spectral Theory of Toeplitz Opera- 
tors, Princeton University Press, Princeton, NJ, 1981. 

[BSj] L. Boutet de Monvel and J. Sjostrand, Sur la singularite des noyaux de Bergman 

et du Szegé, Asterisque 34-36(1976), 123-164. 


[BT] 
[BHLN] 
[BDF] 
[Cat] 
[Cha] 
[CNS] 
[Chr] 
[D] 
[EMM] 
[F] 
[FeK] 
[FK] 
[FS] 
[GaS] 
[Gel] 
[GS] 
[GR] 
[HP1] 
[HP2] 
[HN] 
[HT] 
[HT2] 
[Hol] 
[Ho2] 


[Ho3] 


References 603 


L. Boutet de Monvel and F. Treves, On a class of pseudodifferential operators 
with double characteristics, CPAM 27(1974), 59-89. 

J. Brinkschulte, C.D. Hill, J. Leitener, and M. Nacinovich, Aspects of the Levi 
form, Boll. Unione Mat. Ital. 13 (2020), 71-89. 

L. Brown, R. Douglas, and P. Fillmore, Extensions of C*-algebras and 
K-homology, Ann. Math. 105(1977), 265-324. 

D. Catlin, Subelliptic estimates for the Q-Neumann problem on pseudoconvex 
domains, Ann. Math. 126(1987), 131-191. 

D. Chang, On L” and Hilder estimates for the 0-Neumann problem on strongly 
pseudoconvex domains, Math. Ann. 282(1988), 267-297. 

D. Chang, A. Nagel, and E. Stein, Estimates for the 0-Neumann problem in 
pseudoconvex domains of finite type in C?, Acta Math. 169(1992), 153-228. 
M. Christ, Regularity properties of the Jy-equation on weakly pseudoconvex CR 
manifolds of dimension three, J. AMS 1(1988), 587-646. 

A. Dynin, An algebra of pseudodifferential operators on the Heisenberg group; 
symbolic calculus, Soviet Math. Dokl. 17(1976), 508-512. 

C. Epstein, R. Melrose, and G. Mendoza, Resolvent of the Laplacian on strictly 
pseudoconvex domains, Acta Math. 167(1991), 1-106. 

C. Fefferman, The Bergman kernel and biholomorphic mappings of pseudocon- 
vex domains, Invent. Math. 26(1974), 1-65. 

C. Fefferman and J. J. Kohn, Holder estimates on domains of complex dimension 
2 and on 3 dimensional CR manifolds, Advances Math. 69(1988), 223-303. 

G. Folland and J. J. Kohn, The Neumann Problem for the Cauchy—Riemann 
Complex, Princeton University Press, Princeton, NJ, 1972. 

G. Folland and E. Stein, Estimates for the 0, complex and analysis on the 
Heisenberg group, CPAM 27(1974), 429-522. 

P. Garabedian and D. Spencer, Complex boundary problems, Trans. AMS 73 
(1952), 223-242. 

D. Geller, Analytic Pseudodifferential Operators on the Heisenberg Group and 
Local Solvability, Princeton University Press, Princeton, NJ, 1990. 

P. Greiner and E. Stein, Estimates for the 0-Neumann Problem, Princeton 
University Press, Princeton, NJ, 1977. 

R. Gunning and H. Rossi, Analytic Functions of Several Complex Variables, 
Prentice-Hall, Englewood Cliffs, NJ, 1965. 

R. Harvey and J. Polking, Fundamental solutions in complex analysis, Duke 
Math. J. 46(1979), 253-340. 

R. Harvey and J. Polking, The 0-Neumann kernel in the ball in C”, Proc. Symp. 
Pure Math. 41(1984), 117-136. 

C. D. Hill and M. Nacinovich, Pseudoconcave CR manifolds, Complex Analysis 
and Geometry (Trento 1993), 275-297. 

C. D. Hill and M. Taylor, Integrability of rough almost complex structures, J. 
Geom. Anal. 13 (2003), 163-172. 

C. D. Hill and M. Taylor, The complex Frobenius theorem for rough involutive 
structures, Trans. AMS 359 (2007), 293-322. 

L. Hérmander, L? estimates and existence theorems for the 0 operator, Acta 
Math. 113(1965), 89-152. 

L. Hérmander, The Frobenius—Nirenberg theorem, Arkiv. Mat. 5(1965), 
425-432. 

L. Hérmander, Introduction to Complex Analysis in Several Variables, Van 
Nostrand, Princeton, NJ, 1966. 


604 12. The 0-Neumann Problem 


[Ho4] 
[Ker] 
[Ker2] 
[KS] 
[Kh] 
[K1] 
[K2] 
[K3] 
[KN] 
[KN2] 
[KR] 
[Kr1] 
[Kr2] 
[LM] 
[LU] 
[Lel] 
[LR] 
[Mal] 


[MMT] 


[Mor] 
[NRSW] 
[NeNi] 
[NW] 


[Ni] 


L. Hérmander, Pseudo-differential operators and non-elliptic boundary prob- 
lems, Ann. Math. 83(1966), 129-209. 

N. Kerzman, Holder and L” estimates for solutions to du = f in strongly pseu- 
doconvex domains, CPAM 24(1971), 301-379. 

N. Kerzman, The Bergman kernel function: Differentiability at the boundary, 
Math. Annalen 195(1972), 149-158. 

N. Kerzman and E. Stein, The Szegé kernel in terms of Cauchy—Fantappie ker- 
nels, Duke Math. J. 45(1978), 197-224. 

G. Khenkin, The method of integral representations in complex analysis, 
pp. 19-116 in [Vit]. 

J. J. Kohn, Harmonic integrals on strictly pseudoconvex manifolds I, Ann. Math. 
78(1963), 112-148; I], Ann. Math. 79(1964), 450-472. 

J.J. Kohn, Subellipticity of the 0-Neumann problem on pseudoconvex domains; 
sufficient conditions, Acta Math. 142(1979), 79-122. 

J. J. Kohn, A survey of the 0-Neumann problem, Proc. Symp. Pure Math. 
41(1984), 137-145. 

J.J. Kohn and L. Nirenberg, Non-coercive boundary problems, CPAM 18(1965), 
443-492. 

J. J. Kohn and L. Nirenberg, A pseudo-convex domain not admitting a holomor- 
phic support function, Math. Ann. 201(1973), 265-268. 

J. J. Kohn and H. Rossi, On the extension of holomorphic functions from the 
boundary of a complex manifold, Ann. Math. 81(1965), 451-472. 

S. Krantz, Function Theory of Several Complex Variables, Wadsworth, Belmont, 
CA, 1992. 

S. Krantz, Partial Differential Equations and Complex Analysis, CRC, Boca 
Raton, FL, 1992. 

C. Lebrun and L. Mason, Zoll manifolds and complex surfaces, J. Diff: Geom. 
61 (2002), 453-535. 

J. Lee and G. Uhlmann, Determining anisotropic real-analytic conductivities by 
boundary measurements, CPAM 42(1989), 1097-1112. 

P. Lelong, Plurisubharmonic Functions and Positive Differential Forms, Gordon 
and Breach, New York, 1969. 

I. Lieb and R. Range, Integral representations and estimates in the theory of the 
0-Neumann problem, Ann. Math. 123(1986), 265-301 

B. Malgrange, Sur l’intégrabilité des structures presque-complexes, Symp. 
Math. IT INDAM, Rome, 1968), 289-296. 

I. Mitrea, M. Mitrea, and M. Taylor, Cauchy integrals, Calderon projectors, and 
Toeplitz operators on uniformly rectifiable domains, Advances in Math. 268 
(2015), 666-757. 

C. B. Morrey, Multiple Integrals in the Calculus of Variations, Springer, 
New York, 1966. 

A. Nagel, J. Rosay, E. Stein, and S. Wainger, Estimates for the Bergman and 
Szeg6 kernels in C?, Ann. Math. 129(1989), 113-149. 

A. Newlander and L. Nirenberg, Complex analytic coordinates in almost- 
complex manifolds, Ann. Math. 65(1957), 391-404. 

A. Nijenhuis and W. Woolf, Some integration problems in almost-complex man- 
ifolds, Ann. of Math. 77 (1963), 424-489. 

L. Nirenberg, Lectures on Linear Partial Differential Equations, Reg. Conf. Ser. 
in Math., no. 17, AMS, 1972. 


[PS] 
[Pon] 
[RS] 
[Sj] 
[Sm] 
[Sp] 
[St] 
[Sw] 
[Tai] 
[Tar] 


[Tay] 
[Tay2] 


[Trl] 


(Tr2] 
[Ven] 


[Vit] 


References 605 


D. Phong and E. Stein, Hilbert integrals, singular integrals, and Radon trans- 
forms, II, Invent. Math. 86(1986), 75-113. 

R. Ponge, Heisenberg calculus and spectral theory of hypoelliptic operators on 
Heisenberg manifolds, Mem. AMS #906 (2008). 

L. Rothschild and E. Stein, Hypoelliptic differential operators and nilpotent 
groups, Acta Math. 137(1976), 247-320. 

J. Sjostrand, Parametrices for pseudodifferential operators with multiple charac- 
teristics, Ark. Math. 2(1974), 85-130. 

H. Smith, A calculus for three-dimensional CR manifolds of finite type, J. Func. 
Anal. 120(1994), 135-162. 

D. Spencer, Overdetermined systems of linear partial differential equations, Bull. 
AMS 75(1969), 179-239. 

E. Stein, Harmonic Analysis, Princeton University Press, Princeton, NJ, 1993. 
W. Sweeney, The D-Neumann problem, Acta Math. 120(1968), 224-277. 

G. Taiani, Cauchy—Riemann (CR) Manifolds, Pace University, New York, 1989. 
D. Tartakoff, The local real analyticity of solutions to 0, and the 0-Neumann 
problem, Acta Math. 145(1980), 177-204. 

M. Taylor, Noncommutative Microlocal Analysis I, Mem. AMS #313 (1984). 
M. Taylor, Microlocal Weyl formula on contact manifolds, Comm. PDE 45 
(2020), 392-413. 

F. Treves, Analytic hypo-ellipticity of a class of pseudodifferential operators 
with double characteristics and applications to the 0-Neumann problem, Comm. 
PDE 3(1978), 475-642. 

F. Treves, Introduction to Pseudodifferential and Fourier Integral Operators, 
Vols. 1, 2, Plenum, New York, 1980. 

U. Venugopalkrishna, Fredholm operators associated with strongly pseudocon- 
vex domains, J. Func. Anal. 9(1972), 349-373. 

A. Vitushkin (ed.), Several Complex Variables I, Encycl. of Math. Sci., Vol.7, 
Springer, New York, 1990. 


C 


Connections and Curvature 


Introduction 


In this appendix we present results in differential geometry that serve as a 
useful background for material in the main body of the book. Material in §1 on 
connections is somewhat parallel to the study of the natural connection on a Rie- 
mannian manifold made in 811 of Chap. 1, but here we consider connections on 
general vector bundles, and also study the curvature of a connection. Material in 
§2 on second covariant derivatives is connected with material in Chap. 2 on the 
Laplace operator. Ideas developed in §§3 and 4, on the curvature of Riemannian 
manifolds and submanifolds, make contact with such material as the existence 
of complex structures on two-dimensional Riemannian manifolds, established in 
Chap. 5, and the uniformization theorem for compact Riemann surfaces and other 
problems involving nonlinear, elliptic PDE, arising from studies of curvature, 
treated in Chap. 14. Section 5 on the Gauss—Bonnet theorem is useful both for 
estimates related to the proof of the uniformization theorem and for applications 
to the Riemann—Roch theorem in Chap. 10. Furthermore, it serves as a transition 
to more advanced material presented in §§6-8. 

In 86 we discuss how constructions involving vector bundles can be derived 
from constructions on a principal bundle. In the case of ordinary vector fields, 
tensor fields, and differential forms, one can largely avoid this, but it is a very 
convenient tool for understanding spinors. The principal bundle picture is used to 
construct characteristic classes in §7. The material in these two sections is needed 
in Chap. 10, on the index theory for elliptic operators of Dirac type. In §8 we show 
how one particular characteristic class, arising from the Pfaffian, figures into the 
higher-dimensional version of the Gauss—Bonnet formula. The proof given here 
is geometrical and uses the elements of Morse theory. In Chap. 10 this result is 
derived as a special case of the Atiyah—Singer index formula. 

Section 9 provides an introduction to an important class of complex manifolds 
known as Kahler manifolds, and notes special properties of their Riemann and 
Ricci curvature. 


© The Editor(s) (if applicable) and The Author(s), under exclusive license 607 
to Springer Nature Switzerland AG 2023 

M. E. Taylor, Partial Differential Equations I, Applied Mathematical Sciences 116, 
https://doi.org/10.1007/978-3-03 1-33700-0 


608 C. Connections and Curvature 


1. Covariant derivatives and curvature on general vector 
bundles 


Let E — M be a vector bundle, either real or complex. A covariant derivative, 
or connection, on FE is a map 


(1.1) Vx :C™(M,E) — C”(M, BE) 


assigned to each vector field X on M, satisfying the following three conditions: 


(1.2) Vx(utv) =Vxut+Vxu, 
(1.3) Vcpxty)u = fVxut+tVyu, 
(1.4) Vx(fu) = fVxut (Xf)u, 


where u,v are sections of F, and f is a smooth scalar function. The examples 
contained in Chaps. | and 2 are the Levi—Civita connection on a Riemannian man- 
ifold, in which case F is the tangent bundle, and associated connections on tensor 
bundles, discussed in §2.2. 

One general construction of connections is the following. Let F' be a vector 
space, with an inner product; we have the trivial bundle 7 x F’. Let E be a sub- 
bundle of this trivial bundle; for each x € M, let P, be the orthogonal projection 
of F on E, C F. Any u € C™®(M, E) can be regarded as a function from MM to 
fF’, and for a vector field _X, we can apply X componentwise to any function on MW 
with values in F’; call this action u+> D xu. Then a connection on M is given by 


(1.5) Vxu(z) = P,Dxu(z). 


If M is imbedded in a Euclidean space R%, then T,,M is naturally identified 
with a linear subspace of R% for each « € M. In this case it is easy to verify 
that the connection defined by (1.5) coincides with the Levi—Civita connection, 
where M is given the metric induced from its imbedding in R’. Compare with 
the discussion of submanifolds in §4 below. 

Generally, a connection defines the notion of “parallel transport” along a curve 
yin M. A section u of E over ¥ is obtained from u(y(to)) by parallel transport 
if it satisfies Vru = 0 ony, where T = 4(t). 

Formulas for covariant derivatives, involving indices, are produced in terms of 
a choice of “local frame” for F, that is, a set eg, 1 < a < K, of sections of EF 
over an open set U which forms a basis of F,, for each x € U; K = dim Ey. 
Given such a local frame, a smooth section u of E over U is specified by 


(1.6) u=wueg (summation convention). 
If D; = 0/0x; in a coordinate system on U, we set 


(1.7) Vp,u =u ,j€o = (Oju% + wD 9; )ea, 


1. Covariant derivatives and curvature on general vector bundles 609 
the connection coefficients I~ g,; being defined by 
(1.8) V De =" g5€q- 


A vector bundle & — M may have an inner product on its fibers. In that case, 
a connection on F is called a metric connection provided that 


(1.9) X (u,v) = (Vxu,v) + (u, Vxv), 


for any vector field X and smooth sections u,v of E. 
The curvature of a connection is defined by 


(1.10) R(X, Y)u = [Vx, Vylu— Vixyu, 


where X and Y are vector fields and wu is a section of FE. It is easy to verify that 
(1.10) is linear in X,Y, and u, over C(M). With respect to local coordinates, 
giving D; = 0/0x;, and a local frame {e,} on FE, as in (1.6), we define the 
components R%g;;, of the curvature by 


(1.11) R(D;, Dx)eg — Re ajrea; 
as usual, using the summation convention. Since D; and D; commute, 
R(D;,Dxjes = [Vp,,Vp,]es- Applying the formulas (1.7) and (1.8), we 


can express the components of f in terms of the connection coefficients. The 
formula is seen to be 


(1.12) R a5 = OjT Bk — OWL g; + PsP ge = P5407 gj. 

The formula (1.12) can be written in a shorter form, as follows. Given a choice 
of local frame {e, : 1 < a < K}, we can define K x Kk matrices T; = (I'%g;) 
and also Rj, = (R°%g;x). Then (1.12) is equivalent to 
(1.13) Rik = OjV', — Onl + P;,Tx]. 


Note that Rj; is antisymmetric in j and k. Now we can define a “connection 
1-form” T and a “curvature 2-form” 2 by 


1 
(1.14) P= Yj ej, = 5D Pak dng Ada 
J Is 


Then the formula (1.12) is equivalent to 


(1.15) Q=dF +a. 


610 C. Connections and Curvature 


The curvature has symmetries, which we record here, for the case of general 
vector bundles. The Riemann curvature tensor, associated with the Levi—Civita 
connection, has additional symmetries, which will be described in 83. 


Proposition 1.1. For any connection V on E — M, we have 
(1.16) R(X,Y)u=—R(Y,X)u. 

If V is a metric connection, then 

(1.17) (R(X, Y)u, v) = —(u, R(X, Y)v). 


Proof. Equation (1.16) is obvious from the definition (1.10); this is equivalent to 
the antisymmetry of Rg; in j and k noted above. If V is a metric connection, 
we can use (1.9) to deduce 


0 = (XY —YX —[X,Y])(uv) 
= (R(X, Y)u,v) + (u, R(X, Y)v), 


which gives (1.17). 


Next we record the following implication of a connection having zero curva- 
ture. A section u of F& is said to be “parallel” if Vu = 0 for all vector fields X. 


Proposition 1.2. If E — M has a connection V whose curvature is zero, then 
any p © M has aneighborhood U on which there is a frame {€.,} for E consisting 
of parallel sections: Vxé€q =O forall X. 


Proof. If U is a coordinate neighborhood, then e, is parallel provided V ;e, = 0 
for 7 = 1,...,n = dim WM. The condition that R = 0 is equivalent to the 
condition that the operators Vp, all commute with each other, for 1<j <n. 
Consequently, Frobenius’s theorem (as expanded in Exercise 5 in 89 of Chap. 1) 
allows us to solve the system of equations 


(1.18) Vpéa=0, j=l,...,n, 


on a neighborhood of p, with e, prescribed at the point p. If we pick e,(p), 1 < 
a < K, to bea basis of E,, then e, (x), 1 < a < K, will be linearly independent 
in FE, for x close to p, so the local frame of parallel sections is constructed. 


It is useful to note, in general, several formulas that result from choosing a 
local frame {e,,} by parallel translation along rays through a point p € M, the 
origin in some coordinate system (71,...,2n), SO 


(1.19) Vra/area =0, 1<a<K. 


1. Covariant derivatives and curvature on general vector bundles 611 


This means >> xjVp;€a = 0. Consequently, the connection coefficients (1.8) 
satisfy 
(1.20) e1P% gi +++ + an gn = 0. 


Differentiation with respect to x; gives 

(1.21) Pg; = —210;0%g1 — +++ — fp OjT gn. 
In particular, 

(1.22) Tg; (p) = 0. 

Comparison of (1.21) with 

(1.23) TP gy = 210,05 (p) + --- + tnOnIg;(p) + O(|a|?) 
gives 

(1.24) Onl gj = —OjT ge, at p. 
Consequently, the formula (1.12) for curvature becomes 
(1.25) Re gin = 200 gx, at p, 

with respect to such a local frame. Note that, near p, 

(1.26) R° giz = OF “gn — OnE p5 + O(|2|*). 

Given vector bundles L; — M with connections V/, there is a natural covari- 
ant derivative on the tensor-product bundle F; ® Ey — M, defined by the 
derivation property 
(1.27) Vx(u@v) = (Vu) @vu+u@ (Vxv). 

Also, if A is a section of Hom(F), E2), the formula 
(1.28) (V%A)u = V%(Au) — A(V&v) 
defines a connection on Hom(F, E2). 

Regarding the curvature tensor RF as a section of (@?7*) ® End(£) is natural 
in view of the linearity properties of R given after (1.10). Thus if EL — M has 
a connection with curvature R, and if MM also has a Riemannian metric, yielding 


a connection on T*M, then we can consider Vx R. The following, known as 
Bianchi’s identity, is an important result involving the covariant derivative of R. 


612 C. Connections and Curvature 

Proposition 1.3. For any connection on EK — M, the curvature satisfies 
(1.29) (VzR)(X,Y) + (VxR)(Y,Z) + (Vy R)(Z, X) =0, 

or equivalently 

(1.30) RR pyiz t+ Rayna +R grag = 0: 


Proof. Pick any p € M. Choose normal coordinates centered at p, and choose 
a local frame field for E by radial parallel translation, as above. Then, by (1.22) 
and (1.26), 


(1.31) Re Big:k = OnO;T g; —_ On0;T gi, at p. 


Cyclically permuting (7,7,k) here and summing clearly give 0, proving the 
proposition. 


Note that we can regard a connection on EF as defining an operator 
(1.32) V :C™(M, E) — C™°(M,T* ® E), 


in view of the linear dependence of V x on X. If M has a Riemannian metric and 
£& a Hermitian metric, it is natural to study the adjoint operator 


(1.33) V*:C™°(M,T* @ E) — C™(M, E). 
If u and v are sections of EF, € a section of T*, we have 


(v, V*(E@ u)) = (Vv, € @ u) 
(Vxv, u) 
= (v, Vxu), 


(1.34) 


I 


where X is the vector field corresponding to € via the Riemannian metric. Using 
the divergence theorem we can establish: 


Proposition 1.4. [f E has a metric connection, then 


(1.35) V*(§@u) = Veu=—-Vxu-— (div X)u. 


Proof. The first identity follows from (1.34) and does not require EF’ to have a 
metric connection. If EF does have a metric connection, integrating 


(Vxv,u) = —(v, Vxu) + X(v, u) 


and using the identity 


Exercises 613 


(1.36) Xf dV =— | (div X)fdv, feCe(M), 
| i 


M 


give the second identity in (1.35) and complete the proof. 


Exercises 


di 


If V and V are two connections on a vector bundle FE — M , show that 


(1.37) Vxu = VxutC(X,u), 


where C' is a smooth section of Hom(T'@ E, FE) + T*® End(£). Show that conversely, 


if C is such a section and V a connection, then (1.37) defines Vasa connection. 
If V and V are related as in Exercise 1, show that their curvatures R and R are related 


by 


(1.38) (R—R)(X,Y)u = [Cx, Vy]u— [Cy, Vx]u— Cix,yju + [Cx, Cy]u, 


where C’x is the section of End(£) defined by Cxu = C(X, wu). 

In Exercises 3-5, let P(x), x € M, be a smooth family of projections on a vector 
space F’, with range E.,, forming a vector bundle FE — M; F gets a natural connection 
via (1.5). 

Let y : I — M bea smooth curve through x9 € M. Show that parallel transport of 
u(%o) € Ez, along I is characterized by the following (with P’(t) = dP(y(t))/dt): 


du r 

a P'(t)u. 

If each P(x) is an orthogonal projection of the inner-product space F' onto Ez, show 
that you get a metric connection. (Hint: Show that du/dt L u(y(t)) via P’P = (I — 
P)P’) 

In what sense can T = —dP P = —(I — P) dP be considered the connection 1-form, 
as in (1.13)? Show that the curvature form (1.15) is given by 


(1.39) Q=PdPAdP P. 


6. 


For more on this, see (4.50)—(4.53). 
Show that the formula 


(1.40) dQ =QAT-TAQ 


follows from (1.15). Relate this to the Bianchi identity. Compare with (2.13) in the next 
section. 

Let E — M bea vector bundle with connection V, with two local frame fields {eq } 
and { fa}, defined over U C M. Suppose 


fa(a) = 9° a(x)ea(a), ea() = h’ a(x) fa(z); 


note that g°,(x)h7q(x) = 6%. Let [%g; be the connection coefficients for the frame 
field {e.}, as in (1.7) and (1.8), and let [“g; be the connection coefficients for the 
frame field { f..}. Show that 


614 C. Connections and Curvature 


(1.41) P93 = A? wT 45978 + Ay (8j97 2). 


2. Second covariant derivatives and covariant-exterior 
derivatives 


Let M be a Riemannian manifold, with Levi—Civita connection, and let FE + M 
be a vector bundle with connection. In §1 we saw that the covariant derivative 
acting on sections of & yields an operator 

(2.1) V:C™(M, FE) — C°(M,T* ® E). 

Now on T* © E we have the product connection, defined by (1.27), yielding 
(2.2) V:C™°(T* @ E) > C*(M,T* @T* @ E). 


If we compose (2.1) and (2.2), we get a second-order differential operator called 
the Hessian: 


(2.3) V? : C@(M, E) — C™”(T* @T* @ E). 


If u is a section of & and X and Y are vector fields, (2.3) defines Vy as a 
section of /; using the derivation properties, we have the formula 


(2.4) Viyu = VxVyu-Viyyyyu- 
Note that the antisymmetric part is given by the curvature of the connection on L: 
(2.5) Viyu- Vyxu = R(X,Y)u. 

Now the metric tensor on M/ gives a linear map T* @ T* — R, hence a linear 
bundle map y : T* © T* @ E — E. We can consider the composition of this with 
V? in (2.3): 

(2.6) yo V?: C(M, E) — C™(M, E). 


We want to compare 70 V? and V*V, in the case when EF has a Hermitian metric 
and a metric connection. 


Proposition 2.1. [f V is a metric connection on E, then 


(2.7) V*V =-7°oV? onC™(M,E). 


2. Second covariant derivatives and covariant-exterior derivatives 615 


Proof. Pick a local orthonormal frame of vector fields {e; }, with dual frame {v,; }. 
Then, for u € C°(M, E), Vu = >); ® Ve, u, so (1.35) implies 


(2.8) V'Vu=S—[-Ve,Ve,u— (div e;)ul. 
Using (2.4), we have 
(29) V*Vu=-S°V2,.,u—- do [Vv.,¢;u + (div e;)Ve,u]. 


The first term on the right is equal to —7 o V?u. Now, given p € M, if we choose 
the local frame {e;} such that Ve; €k = 0 at p, the rest of the right side vanishes 
at p. This establishes the identity (2.7). 


We next define a “covariant-exterior derivative” operator 
(2.10) dY :O@(M,A*T* @ B) — C™(M, A**™1T* @ E) 
as follows. For k = 0, dv = V, given by (2.1), and we require 
(2.11) d’(8 Au) = (dB) Au—BAdvu 


whenever (3 is a 1-form and u is a section of A*T* @ E. The operator dV is also 
called the “gauge exterior derivative.” Unlike the case of the ordinary exterior 
derivative, 


dv odY : C® (M, A*T* @ E) —+ C™ (M, A*t?T* @ E) 
is not necessarily zero, but rather 
(2.12) dv’ dYu=QAu, 


where {? is the curvature, and we use the antisymmetry (1.16) to regard Q as a 
section of A?T* @ End(F), as in (1.15). The verification of (2.12) is a straight- 
forward calculation; (2.5) is in fact the special case of this, for k = 0. 

The following is an alternative form of Bianchi’s identity (1.29): 


(2.13) dvQ=0, 

where the left side is a priori a section of A3T* @ End(E). This can also be 
deduced from (2.12), the associative law dY (dV dV) = (dV dY )d¥, and the natu- 
ral derivation property generalizing (2.11): 


(2.14) dY(AAu) = (dv A) Aut (-1)/AA du, 


where u is a section of A*T* @ EF and A isa section of A7T* @ End(F). 


616 C. Connections and Curvature 


Exercises 


1. Let E — M bea vector bundle with connection V, u € C™(M, E). Fix p € M. 
Show that if Vu(p) = 0, then V% yu(p) is independent of the choice of connection 
on M. 

2. In particular, Exercise 1 applies to the trivial bundle R x M, with trivial flat connec- 
tion, for which Vxu = (X,du) = Xu. Thus, if u € C°°(M) is real-valued and 
du(p) = 0, then D?u(p) is well defined as a symmetric bilinear form on T, M. If, ina 
coordinate system, X = }> X; 0/0x;, Y = >> Y; 0/0x;, show that 


(2.15) Dx yu(p) = 


Show that this invariance fails if du(p) # 0. 
3. If wis a smooth section of A*T* @ E, show that 


n 


d¥u(Xo,...,X%) = Di(-1)Vx,u 6 comers, ere) 


(2.16) : . . _ 

+30(-1)'*4u ([x5, Xd ee Cs ee Xx) 
j<e 
Compare with formula (13.56) of Chap. 1 and Exercises 2 and 3 in §3 of Chap. 2. 

4. Verify the identity (2.12), namely, dV’dYu=Q Au. 

5. If V and V are connections on E — M, related by Vxu = Vxu+ C(X,u), 
C € C™(M,T* ® End(£)), with curvatures R and R, and curvature forms 2 and Q, 
show that 


(2.17) Q-H=dYC+CAC. 


Here the wedge product of two sections of T*@End(F) is a section of the bun- 
dle A? T* @End(E), produced in a natural fashion, as in (1.15). Show that (2.17) is 
equivalent to (1.38). 


3. The curvature tensor of a Riemannian manifold 


The Levi—Civita connection, which was introduced in §11 of Chap. 1, is a metric 
connection on the tangent bundle 7M of a manifold / with a Riemannian metric, 
uniquely specified among all such connections by the zero-torsion condition 


(3.1) VyX —VxY =[Y,X]. 
We recall the defining formula 


UVxY,Z) = X(Y,Z) + Y(X, Z) — Z(X,Y) 


G2) XY] 2 ZY) = YZ), 


3. The curvature tensor of a Riemannian manifold 617 


derived in (11.22) of Chap. 1. Thus, in a local coordinate system with the naturally 
associated frame field on the tangent bundle, the connection coefficients (1.8) are 
given by 


1 gt! O9iu , OGku _ 895k 


; 1” 
oo es Or, Ox; Oly 


The associated curvature tensor is the Riemann curvature tensor: 
(3.4) R(X, Y)Z = [Vx, Vy]Z — Vixy]Z- 


In a local coordinate system such as that discussed above, the expression for the 
Riemann curvature is a special case of (1.12), namely, 


(3.5) Ro pem — OT em _ OmT ke Se Peel” jee ne Tal” eds 
Consequently, we have an expression of the form 
(3.6) R? xem = L (Ga, QuOv975) + Q(Gas, OnG-8)) 


where L is linear in the second-order derivatives of gag(x) and Q is quadratic in 
the first-order derivatives of g.3(x), each with coefficients depending on gag(x). 

Building on Proposition 1.2, we have the following result on metrics whose 
Riemannian curvature is zero. 


Proposition 3.1. Jf (/,g) is a Riemannian manifold whose curvature tensor 
vanishes, then the metric g is flat; that is, there is a coordinate system about each 
p © M inwhich gjx(2) is constant. 


Proof. It follows from Proposition 1.2 that on a neighborhood U of p there are 


parallel vector fields Vij), j = 1,...,n = dim M, namely, in a given coordinate 
system 
(3.7) Voy = 0, 1<5, b<n, 


such that V(;)(p) form a basis of TM. Let vj) be the 1-forms associated to V(;) 
by the metric g, so 


(3.8) vg) (X) = GX, Vy), 
for all vector fields X. Hence 
(3.9) Vo,.Ug) =9, 1S J, k<n. 


We have v5) = >> v6) dx;,, with ve) = 15) (Dg) = (De, vy). The zero-torsion 
condition (3.1), in concert with (3.8), gives 


618 C. Connections and Curvature 
(3.10) Oe(v(j), Dr) — Og (vy), De) = (vj), Vb,Dr) = (vj), Vo, De) = 0, 


which is equivalent to 


(3.11) dvgyy=0, g=l,...,n. 

Hence, locally, there exist functions 7;, 7 = 1,...,m, such that 

(3.12) UG) = dz;. 

The functions (x1,...,2n) give a coordinate system near p. In this coordinate 


system the inverse of the matrix (g;,(x)) has entries g/*(x) = (dx;, dx;,). Now, 
by (1.9), 


(3.13) Oeg)* (x) = (Vp, dx;,dxx) + (dx;, Vp, dex) = 0, 


so the proof is complete. 


We have seen in Proposition |.1 that & has the following symmetries: 


(3.14) R(X,Y) =—R(Y,X), 
(3.15) (R(X, Y)Z,W) = —(Z, R(X,Y)W). 


In other words, in terms of 


(3.16) Riri = (Rr, Dy) Dp, Dj), 
we have 

(3.17) Rjkem = —Rjkme 

and 

(3.18) Rikem = —Rrjem- 


The Riemann tensor has additional symmetries: 


Proposition 3.2. The Riemann tensor satisfies 


(3.19) R(X,Y)Z+ RY, Z)X + R(Z,X)Y =0 
and 
(3.20) (R(X, Y)Z,W) = (R(Z,W)X,Y), 


or, in index notation, 


3. The curvature tensor of a Riemannian manifold 619 


(3.21) Rijne + Riney + Riejk = 0 
and 
(3.22) Rijke = Rees: 


Proof. Plugging in the definition of each of the three terms of (3.19), one gets a 
sum that is seen to cancel out by virtue of the zero-torsion condition (3.1). This 
gives (3.19) and hence (3.21). The identity (3.22) is an automatic consequence of 
(3.17), (3.18), and (3.21), by elementary algebraic manipulations, which we leave 
as an exercise, to complete the proof. Also, (3.22) follows from (3.50) below. 


The identity (3.19) is sometimes called Bianchi’s first identity, with (1.29) then 
called Bianchi’s second identity. 

There are important contractions of the Riemann tensor. The Ricci tensor is 
defined by 
(3.23) Rityy = Big = 9" Rajni: 


where the summation convention is understood. By (3.22), this is symmetric in 
j, k. We can also raise indices: 


(3.24) Ric! , = g)"Rice,; Ric?” = g*Ric4y. 
g g 

Contracting again defines the scalar curvature: 

(3.25) S = Rie’; 


As we will see below, the special nature of Rj;,¢ for dim M = 2 implies 
: 1 ae 
(3.26) Ric;, = 5 59ik if dim M = 2. 


The Bianchi identity (1.29) yields an important identity for the Ricci tensor. 
Specializing (1.30) to a = 1, @ = j and raising the second index give 


(3.27) RY 5k + RY jess + RY pi.j = 0, 
hence, S.4 — Ric’ jj — Ric? ,.; = 0, or 
(3.28) Sip = 2 Ric? ,.;. 


This is called the Ricci identity. An equivalent form is 


ae oe: 
(3.29) Ric!* ; = 5 (8 g)*),5. 


620 C. Connections and Curvature 


The identity in this form leads us naturally to a tensor known as the Einstein 
tensor: 


= : 1 
(3.30) GIF = Ricd* — so g*. 


The Ricci identity is equivalent to 
jk 
(3.31) G?"., = 0. 


As shown in Chap. 2, this means the Einstein tensor has zero divergence. This fact 
plays an important role in Einstein’s equation for the gravitational field. Note that 
by (3.26) the Einstein tensor always vanishes when dim M = 2. On the other 
hand, the identity (3.31) has the following implication when dim M > 2. 


Proposition 3.3. If dim M = n > 2 and the Ricci tensor is a scalar multiple of 
the metric tensor, the factor necessarily being 1/n times the scalar curvature: 


1 
(3.32) Ricjx = 7 Gk: 


then S must be a constant. 


Proof. Equation (3.32) is equivalent to 
1 1 h 
(3.33) Gik = (+ _ 5) Sq. 


By (3.31) and the fact that the covariant derivative of the metric tensor is 0, we 


have ; i 
n 2 : 


or S., = 0, which proves the proposition. 


We now make some comments on the curvature of Riemannian manifolds 
of dimension 2. By (3.17) and (3.18), in this case each component Rjxem of the 
curvature tensor is either 0 or + the quantity 


(3.34) Rize = Roiai =gK, g = det(gjx). 


One calls the Gauss curvature of M when dim M = 2. 
Suppose we pick normal coordinates centered at p € M, so gjx(p) = Ojx. We 
see that if dim MW = 2, 


Ricjx(p) = Rijik + Rajon: 
Now, the first term on the right is zero unless 7 = k = 2, and the second term is 


zero unless j = k = 1. Hence, Ric;;,(p) = K(p)d;x, in normal coordinates, so in 
arbitrary coordinates 


3. The curvature tensor of a Riemannian manifold 621 
: sneer 
(3.35) Ricj, = Kgjx; hence K = 5° if dim M = 2. 
Explicit formulas for K when M is a surface in R® are given by (4.22) and 
(4.29), in the next section. (See also Exercises 2 and 5-7 below.) The following 
is a fundamental calculation of the Gauss curvature of a two-dimensional surface 


whose metric tensor is expressed in orthogonal coordinates: 


(3.36) ds? = E(x) dx? + G(2) dz. 


Proposition 3.4. Suppose dim M = 2 and the metric is given in coordinates by 
(3.36). Then the Gauss curvature k(a) is given by 


a3y Ka) =-ztes [a (2S) +0 (22)] 


To establish (3.37), one can first compute that 


PSP a) = 


1(E"0E EE 
Pl\=-G Oe iG '3,e) 


; 1 E~'0,E —E'!0,G 
To = (TY eee 
2= (Mn) = 5 ae G18. 
Then, computing #12 = (R12) = 0112 — O20, +112 — PoP, we have 


1 O1G 1 02E 
Rhos = 501 ( ) 52 ( =) 


3.38 
oe 1 O,E 01G te 02.E 02G 1/OEQE GAG 
4 E E EG 4\ FE E E Gi) 


Now R212 = E R'o12 in this case, and (3.34) yields 


1 1 
3.39 k(x) = —~ Ria = = R'ar. 
( ) (x) EG 1212 G 212 
If we divide (3.38) by G and then in the resulting formula for k(a) interchange E 
and G, and 0) and Qo, and sum the two formulas for k(x), we get 


co=-4fo0 22) +4022) 
slam 22) 20 (28) 


622 C. Connections and Curvature 


which is easily transformed into (3.37). 
If E = G = e”, we obtain a formula for the Gauss curvature of a surface 
whose metric is a conformal multiple of the flat metric: 


Corollary 3.5. Suppose dim M = 2 and the metric is given in coordinates by 
(3.40) g5x(x) = 7° S56, 

for a smooth v. Then the Gauss curvature k(x) is given by 

(3.41) k(x) = —(Aov)e~”, 

where Ag is the flat Laplacian in these coordinates: 


Ov Ov 


42 Koi — 
a a Ox? Ox2 


For an alternative formulation of (3.41), note that the Laplace operator for the 
metric gj is given by 


Af=g /*0;(g"9'!? Onf); 
and in the case (3.40), g/* = e~?”59* and g\/? = e?”, so we have 
(3.43) Af =e7?"Aof, 
and hence (3.41) is equivalent to 
(3.44) k(x) = —Av. 
The comparison of the Gauss curvature of two surfaces that are conformally 


equivalent is a source of a number of interesting results. The following general- 
ization of Corollary 3.5 is useful. 


Proposition 3.6. Let M be a two-dimensional manifold with metric g, whose 
Gauss curvature is k(a). Suppose there is a conformally related metric 


(3.45) g =eg. 
Then the Gauss curvature K (2) of g' is given by 
(3.46) K(x) = (-Au+k(x))e~™, 


where A is the Laplace operator for the metric g. 


Proof. We will use Corollary 3.5 as a tool in this proof. It is shown in Chap. 5, 
§11, that (M,g) is locally conformally flat, so we can assume without loss of 


3. The curvature tensor of a Riemannian manifold 623 
generality that (3.40) holds; hence & (2) is given by (3.41). Then 
(3.47) (9) jn = O'S jn, wW=uty, 
and (3.41) gives 
(3.48) K(x) = —(Apw)e-” = [—(Aou)e~?” — (Agu)e~?"]e~™™. 


By (3.43) we have (Agu)e~?” = Au, and applying (3.41) for k(x) gives (3.46). 

We end this section with a study of 0;0xgem(po) when one uses a geodesic 
normal coordinate system centered at po. We know from 811 of Chap. 1 that in 
such a coordinate system, I’ ;1,(p0) = 0 and hence 0; 9¢(po) = 0. Thus, in such 
a coordinate system, we have 


(3.49) R? pem(po) = Ol! km(Po) — OmI? ne(po); 


and hence (3.3) yields 
1 
(3.50) Rjkem(Po) = 5 (OjOmgke + OpOCGjim — O;ACGkm — OkOmgGje) - 


In light of the complexity of this formula, the following may be somewhat sur- 
prising. Namely, as Riemann showed, one has 


1 1 
(3.51) 0;0ngem(Po) = — 3 Rejme = 3 ReKmj- 
This is related to the existence of nonobvious symmetries at the center of a 
geodesic normal coordinate system, such as 0j;0%9em(Po) = O¢Omg9jx(po). To 
prove (3.51), by polarization it suffices to establish 


2 


3 Feats V 9,8. 


(3.52) 8 gee(po) = 
Proving this is a two-dimensional problem, since (by (3.50)) both sides of the 
asserted identity in (3.52) are unchanged if M is replaced by the image under 


Exp, of the two-dimensional linear span of D; and Dy. All one needs to show is 
that if dim M = 2, 


2 
(3.53) 07. 922(po) = — 3K (po) and 0?911(po) = 0, 


where K (pg) is the Gauss curvature of IV at po. Of these, the second part is trivial, 
since gii(x) = 1 on the horizontal line through po. To establish the first part of 
(3.53), it is convenient to use geodesic polar coordinates, (r, @), in which 


(3.54) ds? = dr? + G(r, 6) dé”. 


624 C. Connections and Curvature 


It is not hard to show that G(r, 0) = r?H(r,0), with H(r,@) = 1+ O(r?). For 
the metric (3.54), the formula (3.37) implies that the Gauss curvature is 
H, Hp, , H? 


= 1 2 1 2_ 
om Bene gem OO) "er ~ oan aa 


so at the center 


(3.56) K (po) = Ay, 5 Her = 5 rr 


On the other hand, in normal coordinates (71,272), along the x)-axis, we have 
g22(s,0) = G(s,0)/s? = H(s,0), so the rest of the identity (3.53) is established. 


Exercises 


Exercises 1-3 concern the problem of producing two-dimensional surfaces with con- 
stant Gauss curvature. 
1. For a two-dimensional Riemannian manifold MM, take geodesic polar coordinates, so 
the metric is 
ds? = dr* + G(r, 0) de”. 


Use the formula (3.55) for the Gauss curvature, to deduce that 


2 
K= _ ove 
VG 
Hence, if kK = —1, then 
a@J/G = VG. 


Show that 
VG(0,0)=0, 0,VG(0,0) =1, 
and deduce that VG(r, 4) = ¢(r) is the unique solution to 


Deduce that 
G(r,0) = sinh? r. 


Use this computation to deduce that any two surfaces with Gauss curvature —1 are 
locally isometric. 
2. Suppose M is a surface of revolution in R*, of the form 


a” +y” = g(z)”. 
If it is parameterized by x = g(u) cosv, y = g(u) sinv, z = u, then 
ds* = (1+ g'(u)?) du? + g(u)? dv. 


Deduce from (3.37) that 


Exercises 625 


K- gu) 
g(u)(1 + g'(u)2)* 


Hence, if kK = —1, : 
g''(u) = g(u)(1+g'(u))”. 


Note that a sphere of radius R is given by such a formula with g(u) = VR? — u?. 


Compute KX’ in this case. 
2A. Suppose instead that / is a surface of revolution, described in the form 


z= f (v2? +y?) ; 
If it is parameterized by x = u cosv, y = u sinv, z = f(u), then 
ds* = (1+ f’(u)’) du? + u? dv”. 
Show that 
1 d 1 y’(u) 1 
K= ee Ae 7 ee 
uy i+ Fu? at ran) ae 


Thus deduce that 


K 1> y(u) =wW +e fu) I\V=n 1 du. 


We note that this is an elliptic integral, for most values of c. Show that, for c = 0, you 
get 


flu) = V1 ~ 5 og (1+ VI) | 5 tos (1 Vi-). 


3. Suppose M is a region in R® whose metric tensor is a conformal multiple of the 
standard flat metric 
2 
Qik = E(x)d;% =e . Ozh. 


Suppose £ = E(r), v = v(r). Deduce from (3.37) and (3.41) that 


K=-55 (2) | *E'(r)) t aaa B'(r)? = (" | *v'(r)) a, 


Hence, if kK = —1, 


Compute K when 
= —.~ 6 
Dik aq-ryp jk 


4. Show that whenever g;,(x) satisfies g;.(po) = 5jx, Ocgjx(po) = 0, at some point 
po, then (3.50) holds at po. If dim M = 2, deduce that 


1 
(3.57) K (po) = 5 (i922 + O5911 20102912). 


5. Suppose M C R? is the graph of 


626 C. Connections and Curvature 


so, using the natural (21, x2)-coordinates on M, 
ds? = (1+ fP) de? + 2fi fo dai dro + (1+ f2) dx3, 


where f; = 0; f. Show that if V f(0) = 0, then Exercise 4 applies, so 


(3.58) Vf(0) = 0 => K(0) = fir foo — fro. 


Compare the derivation of (4.22) in the next section. 

6. If M C R’ is the surface of Exercise 5, then the Gauss map N : M — S? is given 
by 

(—f 1; af: 25 1) 

Jit fR+f2 


Show that if Vf (0) = 0, then, at po = (0, f(0)), DN (po) : R? — R? is given by 


(3.59) ee ae ee 7) 


N(a, f(x)) = 


Here, T;,, M and T(o,0,1)S ? are both identified with the (1, a2)-plane. Deduce from 


Exercise 5 that 
K (po) = det DN (po). 


7. Deduce from Exercise 6 that whenever M is a smooth surface in R°, with Gauss map 
N:M — 8°, then, with DN(x) : T;M > Ty 2S”, 


(3.60) K(x) = detDN(x), VreM. 


(Hint: Given x € M, rotate coordinates so that T;, M is parallel to the (71, x2)-plane.) 
This result is Gauss’ Theorema Egregium for surfaces in R®. See Theorem 4.4 for a 
more general formulation; see also (4.35), and Exercises 5, 8, 9, and 14 of §4. 

8. Recall from §11 of Chap.1 that if ys(t) is a family of curves y; : [a,b] >~ M 
satisfying y;(a) = p, ys(b) = q, and if E(s) = (er. T)dt, T = »4(t), 
then, with V = (0/0s)y<(t)|s=0, E’(s) = —2 Lew, VrT) dt, leading to the 
stationary condition for E that VrT' = 0, which is the geodesic equation. Now sup- 
pose ¥r,s(t) is a two-parameter family of curves, yr,;(@) = p, Yr,s(b) = q. Let 
V = (0/08) yr,s()|0,0, W = (0/0r)yr,s(t)|o,0. Show that 


2 


b 
G61) 2 B(0,0) =2 / [(R(W, DYV,T) + (VrV,VrW) — (VwV,VrT)] db. 


Note that the last term in the integral vanishes if -Yo,o is a geodesic. 
9. If Z is a Killing field, generating an isometry on M (as in Chap. 2, §3), show that 


Zy;kie = Roky Zm. 


(Hint: From Killing’s equation Zj,, + Zp,j3 = 0, derive Zj:%,¢ = —Zk0e.j3 — 
R”™ xej Zm. Iterate this process two more times, going through the cyclic permutations 
of (j, k, 2). Use Bianchi’s first identity.) Note that the identity desired is equivalent to 


Vixy)Z = R(Y,Z)X _ if Z isa Killing field. 


Exercises 627 


10. Derive the following equation of Jacobi for a variation of geodesics. If ys(t) is a 
one-parameter family of geodesics, X = yi(t), and W = (0/0s)7s, then 


VxVx W = R(X,W)X. 


(Hint: Start with O = Vw V xX, and use [X, W] = 0.) 

11. Raising the second index of Ri poms you obtain R?* pm, the coordinate expression 
for R, which can be regarded as a section of End(A?7’). Suppose M = X x Y 
with a product Riemannian metric and associated curvatures R, Rx, Ry. Using the 
splitting 

MV @W)=A7V @ (A'V BAW) OAW, 


write R as a3 x 3 block matrix. Show that 


Rx 0 
0 
Ry 


R=] 0 
0 
In Exercises 12-14, let X, Y, Z, and so forth, belong to the space g of left-invariant 

vector fields on a Lie group G, assumed to have a bi-invariant Riemannian metric. 
(Compact Lie groups have these.) 

12. Show that any (constant-speed) geodesic 7 on G with 7(0) = e, the identity element, 
is a subgroup of G, that is, y(s +t) = y(s)y(t). Deduce that Vx X = 0 for X € g. 
(Hint: Given p = y(to), consider the “reflection” Rp(g) = pg” ‘p, an isometry on G 
that fixes p and leaves y invariant, though reversing its direction. From this, one can 
deduce that p? = 7(2to).) 

13. Show that Vx Y = (1/2)[X, Y] for X,Y € g. (Hint:0=VxX =VyY =V x+y) 
(X+Y).) 

This identity is called the Maurer-Cartan structure equation. 

14. Show that 


R(X,Y)Z = =F, Y],Z], (R(X,Y)Z,W) = =7x, Y], [Z, W)). 


15. If EF — M is a vector bundle with connection Vv, and V = V+ C, as in Exer- 
cises 1 and 2 of §1, and / has Levi-—Civita connection D, so that Hom(T’ ® E, E) 
acquires a connection from D and v, which we’ll also denote as Vv, show that (1.38) 
is equivalent to 


(3.62) (R— R)(X,Y)u = (VxC)(¥, u) — (V¥C)(X, u) + [Cx, Cy]u. 


This is a general form of the “Palatini identity.” 

16. If g is a metric tensor and h a symmetric, second-order tensor field, consider the 
family of metric tensors g- = g + Th, for 7 close to zero, yielding the Levi—Civita 
connections 


V' =V4+C(r), 
where V = V°. If C’ = C’(0), show that 


(3.63) (C'(X,Y),Z) = BV AVY, Z)+ Br ny(X, Z)= 5 V2h)(X, Y). 


628 C. Connections and Curvature 


(Hint: Use (3.2).) 
17. Let R(7) be the Riemann curvature tensor of g,, and set R’ = R’(0). Show that 
(3.62) yields 
(3.64) R'(X,Y)Z = (VxC'V(Y, Z) — (Vy C")(X, Z). 
Using (3.63), show that 
ZAR OGY) Z, W) = (Ve,wh) (XZ) + (Va, 2h), W) — (Vw) Z) 


ms — (Vi.zh)(X,W) +h(R(X,Y)Z,W)+h(R(X, YW, Z). 


(Hint: Use the derivation property of the covariant derivative to obtain a formula for 
VxC" from (3.63).) 
18. Show that 


6(R(X,Y)Z,W) = K(X +W,Y4+Z)—-K(V¥+W,X4Z) 


—~K(X,Y+27)-K(Y,X4+W)—KR(Z,X4+W) 
(3.66) ~K(W,Y +Z)+ K(X,Y+W)+K(Y,Z4+W) 
+K(Z,Y+W)+K(W,X+Z)+K(X,Z) 
+E, W)—KOY) — KY, 2); 
where 
(3.67) K(X,Y) SAY YX): 


See (4.34) for an interpretation of the right side of (3.67). 
19. Using (3.51), show that, in exponential coordinates centered at p, the function 
g = det(g;x) satisfies, for |x| small, 


(3.68) g(x) =1-— ; D_ Rice(p)zetm + O((|z|*). 


Deduce that if A4n_1 = area of S”~? C R” and V, = volume of unit ball in R”, 
then, for r small, 


(3.69) V(B,(p)) = (v 


4. Geometry of submanifolds and subbundles 


Let M be a Riemannian manifold, of dimension n, and let S be a submanifold, 
of dimension k, with the induced metric tensor. / has a Levi—Civita connection 
V and Riemann tensor R. Denote by V° and Rg the connection and curvature of 
S, respectively. We aim to relate these objects. The second fundamental form is 
defined by 


(4.1) TNX, Y)= VxY — Vey, 


4. Geometry of submanifolds and subbundles 629 


for X and Y tangent to S. Note that IJ is linear in X and in Y over C'™(S). Also, 
by the zero-torsion condition, 


(4.2) IT(X,Y) =IT(Y,X). 
Proposition 4.1. II(X,Y) is normal to S at each point. 
Proof. If X,Y and Z are tangent to 5S, we have 

(VxY, Z) — (VY, Z) = -(¥,VxZ) + X(Y,Z) + (Y,VXZ) — XY, Z), 
and making the obvious cancelation, we obtain 
(4.3) (II(X,Y), Z) = —-{Y, II(X, Z)). 
Using (4.2), we have 
(4.4) (II(X,Y), Z) = —-{¥, II(Z, X)); 
that is, the trilinear form given by the left side changes sign under a cyclic permu- 
tation of its arguments. Since three such permutations produce the original form, 
the left side of (4.4) must equal its own negative, hence be 0. This proves the 
proposition. 


Denote by v(.S) the bundle of normal vectors to S, called the normal bundle 
of S. It follows that IJ is a section of Hom(T'S ® T'S, v(S)). 


Corollary 4.2. For X and Y tangent to S, V3,Y is the tangential projection on 
TS of VxY. 


Let € be normal to S. We have a linear map, called the Weingarten map, 


(4.5) Ag: T,S — T,S 
uniquely defined by 
(4.6) (AcX,Y) = (6, 11(X,Y)). 


We also define the section A of Hom(v(S) ® T'S,TS) by 
(4.7) A(é, X) = AeX. 
We define a connection on (S$) as follows; if € is a section of v(S), set 


Vxé = PIV xé, 


630 C. Connections and Curvature 


where P+(z) is the orthogonal projection of T;,M onto v,,(S). The following 
identity is called the Weingarten formula. 


Proposition 4.3. If € is a section of v(S), 
(4.8) Vxb = Vxb+ AcX. 


Proof. It suffices to show that Vx€+ A¢X is normal to S. In fact, if Y is tangent 
to S, 


(VxE,¥) + (AcX,Y) = X(E,¥) — (6, Vx¥) + (6, LX, Y)) 
=0-(6, VY) — (6, 1(X,Y)) + (6, 11(X,Y)) 
= 0, 
which proves the proposition. 
An equivalent statement is that, for X tangent to S, € normal to S, 
(4.9) Vx€ = Vib — AX 
is an orthogonal decomposition, into components normal and tangent to S, 
respectively. Sometimes this is taken as the definition of Ag or, equivalently, by 
(4.6), of the second fundamental form. 


In the special case that S'is a hypersurface of M (i.e., dim M = dim S + 1), if 
€ = N isa smooth unit normal field to S, we see that, for X tangent to S, 


1 
(VxN,N) = aX (N,N) = 0, 
so VEN = 0 in this case, and (4.9) takes the form 
(4.10) VxN =—-AnX, 


the classical form of the Weingarten formula. 
We now compare the tensors R and Rg. Let X,Y and Z be tangent to S. Then 


oa VxVyZ = Vx(V}Z +4 II(Y, Z)) 
= VXVYZ + IN(X, VYZ) — Arry.z)X + VXIY, Z). 
Reversing X and Y, we have 
Vy VxZ =VYVXZ + IY, VXZ) — Arx,zyY + Vyl(X, Z). 
Also, 


(4.12) Vixy1Z = Vixy)Z + U(X, YI, Z). 


4. Geometry of submanifolds and subbundles 631 
From (4.11) and (4.12) we obtain the important identity 


(R— Rg)(X,Y)Z ={11(X, V}-Z) — 1(Y,V&Z) — 1([X, Y], Z) 
(4.13) + V\II(Y, Z) — VyII(X, Z)} 
— {Arry,z)X — Arrx,z)Y}- 


Here, the quantity in the first set of braces { } is normal to S, and the quantity in 
the second pair of braces is tangent to 5. The identity (4.13) is called the Gauss— 
Codazzi equation. A restatement of the identity for the tangential components is 
the following, known as Gauss’ Theorema Egregium. 


Theorem 4.4. For X,Y, Z and W tangent to S, 


((R — Rs)(X,Y)Z,W) = (I1(Y, W), II(X, Z)) 


(4.14) — (II(X,W), II(Y, Z)). 


The normal component of the identity (4.13) is specifically Codazzi’s equation. 
It takes a shorter form in case S' has codimension 1 in M. In that case, choose a 
unit normal field NV, and let 


(4.15) II(X,Y) = 1I(X,Y)N; 
TT is a tensor field of type (0,2) on S. Then Codazzi’s equation is equivalent to 


(4.16) (R(X, Y)Z,N) = (V&ID(Y, Z) — (VP ID(X, 2), 


for X,Y, Z tangent to S, since of course Rg(X,Y)Z is tangent to S. 
In the classical case, where S' is a hypersurface in flat Euclidean space, R = 0, 
and Codazzi’s equation becomes 


(4.17) (VYIT)(X, Z) — (VS ID(Y, Z) = 0, 


that is, V°IT is a symmetric tensor field of type (0,3). In this case, from the 
identity IT jx.¢ = IIcz,;, we deduce Aj* x = Ax*.j = (Tr A),;, where A = Ay 
is the Weingarten map. Equivalently, 


(4.18) div A = d(Tr A). 


An application of the Codazzi equation to minimal surfaces can be found in the 
exercises after §6 of Chap. 14. 

It is useful to note the following characterization of the second fundamental 
form for a hypersurface M in R”. Translating and rotating coordinates, we can 
move a specific point p € M to the origin in R” and suppose / is given locally by 


632 C. Connections and Curvature 


tn = f(2'), V0) =, 


where 2’ = (a1,...,2n—1). We can then identify the tangent space of M at p 
with R’-}, 


Proposition 4.5. The second fundamental form of M at p is given by the Hessian 


of f: 


— n-1 2 
(4.19) HAY) = y- ae 


——_— (0) X;Yz. 
p= Da jOx, | ) Rave 


Proof. From (4.9) we have, for any € normal to //, 
(4.20) (I1(X,Y),€) = —(Vx&,Y), 
where V is the flat connection on R”. Taking 

(4.21) €=(-O:f,...,-On-1f,1) 
gives the desired formula. 


If S is a surface in R®, given locally by x3 = f (x1, x2) with V f(0) = 0, then 
the Gauss curvature of S$ at the origin is seen by (4.14) and (4.19) to equal 


0 f (0) 
(4.22) det ( oF 7) 


Consider the example of the unit sphere in R®, centered at (0,0,1). Then the 
“south pole” lies at the origin, near which S$? is given by 


(4.23) ag =1—(1— 2? — 22)¥/?. 

In this case (4.22) implies that the Gauss curvature i is equal to 1 at the south 
pole. Of course, by symmetry it follows that k = 1 everywhere on the unit 
sphere S?. 


Besides providing a good conception of the second fundamental form of a 
hypersurface in R”, Proposition 4.5 leads to useful formulas for computation, one 
of which we will give in (4.29). First, we give a more invariant reformulation of 
Proposition 4.5. Suppose the hypersurface M/ in R” is given by 


(4.24) u(x) =, 


with Vu 4 0 on M. Then we can use the computation (4.20) with € = grad u to 
obtain 


(4.25) (II(X,Y), grad u) = —(D?u)(X,Y), 


4. Geometry of submanifolds and subbundles 633 


where D?u is the Hessian of u; we can think of (D?u)(X,Y) as Y - (D?u)X, 
where D?u is the n x n matrix of second-order partial derivatives of u. In other 
words, 


(4.26) TI(X,Y) = —|grad u|~!(D?u)(X,Y), 


for X,Y tangent to M. 
In particular, if M is a two-dimensional surface in R? given by (4.24), then the 
Gauss curvature at p € M is given by 


(4.27) K(p) = |grad u|~* det (D?u)|7,m, 


where D?ulr, m denotes the restriction of the quadratic form D?u to the tangent 
space T,,M, producing a linear transformation on T,,M via the metric on T,M. 
With this calculation we can derive the following formula, extending (4.22). 


Proposition 4.6. If M Cc R° is given by 
(4.28) v3 = f(x1, 22), 


then, at p = (a’, f(x’)) € M, the Gauss curvature is given by 


(4.29) Kip) = (LF IV/(W)P) der ( at ). 


Ox; OL, 


Proof. We can apply (4.27) with u(x) = f(r1,22) — x3. Note that |Vu|? = 
1+|Vf(2’)|? and 


2 _ (D*f0 
(4.30) Deu=( ; ye 


Noting that a basis of T,M is given by (1,0,0,f) = v1, (0,1, 02f) = v2, we 
readily obtain 


det(v; - (D?u) up) 
det(v,; - ux) 


(4.31) det Dulr,u = = (1+|Vf(a')|?)~* det D?f, 


which yields (4.29). 


If you apply Proposition 4.6 to the case (4.23) of a hemisphere of unit radius, 
the calculation that kK = 1 everywhere is easily verified. The formula (4.29) 
gives rise to interesting problems in nonlinear PDE, some of which are studied in 
Chap. 14. 

We now define the sectional curvature of a Riemannian manifold M. Given 
p € M, let Ibe a2-planeinT,M, & = Exp, (II). The sectional curvature of 
at p is 


(4.32) kk, (II) = Gauss curvature of © at p. 


634 C. Connections and Curvature 


If U and V form an orthonormal basis of T;,: = H, then by the definition of 
Gauss curvature, 


(4.33) K,(II) = (Rx(U,V)V,U). 


We have the following more direct formula for the sectional curvature. 


Proposition 4.7. With U and V as above, R the Riemann tensor of M, 
(4.34) K,() = (R(U,V)V,U). 


Proof. It suffices to show that the second fundamental form of © vanishes at p. 
Since II(X,Y) is symmetric, it suffices to show that IJ(X,X) = 0 for each 
X € T,M. So pick a geodesic 7 in M such that 7(0) = p, 7/(0) = X. Then 
y C &, and y must also be a geodesic in S, so 


VrT =V3T, T=7(t), 


which implies [7(X, X) = 0. This proves (4.34). 
Note that if S C M has codimension 1, p € S, and II Cc 7,5, then, by (4.14), 


(4.35) KS(I1) — KM (Il) = det (FY U) INU, 2 . 


Note how this is a direct generalization of (3.60). 

The results above comparing connections and curvatures of a Riemannian man- 
ifold and a submanifold are special cases of more general results on subbundles, 
which arise in a number of interesting situations. Let E’ be a vector bundle over a 
manifold /, with an inner product and a metric connection V. Let Eg — M bea 
subbundle. For each x € M, let P, be the orthogonal projection of E, onto Egz. 
Set 


(4.36) Ve&ul(x) = P,Vxu(x), 
when u is a section of Eo. Note that, for scalar f, 


Vix fu(x) = Po(fVxu(a) + (X f)u) 
= fP,Vxu(x) + (Xf)u(s), 
provided u is a section of Eo, so P,u(x) = u(x). This shows that (4.36) defines a 
connection on Ep. Since (V{-u, v) = (Vxu, v) for sections u,v of Ep, it is clear 


that V° is also a metric connection. Similarly, if £ is the orthogonal bundle, a 
subbundle of /, a metric connection on F; is given by 


(4.37) Vxu(x) = (I — P,)Vxv(z), 


4. Geometry of submanifolds and subbundles 635 
for a section v of FE}. 
It is useful to treat V° and V' on an equal footing, so we define a new connec- 
tion V on F&, also a metric connection, by 


(4.38) V=Vev!. 


Then there is the relation 


(4.39) Vx =Vx+Cx, 
where 

(4.40) ee 

ME hy 


Here, IIS. : Ey — E; is the second fundamental form of Ey C E, and ITX, : 
E\ - Ep is the second fundamental form of E, C E. We also set II/(X,u) = 
IIXu. In this context, the Weingarten formula has the form 


(4.41) CL =-Ce i J = 0). 


Indeed, for any two connections related by (4.39), with C € Hom(TM ® E, E), 
if V and V are both metric connections, the first part of (4.41) holds. 

We remark that when + is a curve in a Riemannian manifold M, and for 
pey, Ep =TpM, Epp = Toy, Erp = (7), the normal space, and if V is the 
Levi—Civita connection on M, then V is sometimes called the Fermi-Walker 
connection on y. One also (especially) considers a timelike curve in a Lorentz 
manifold. 

Let us also remark that if we start with metric connections V/ on E j> then form 
VonE by (4.38), and then define V on EF by (4.39), provided that (4.40) holds, 
it follows that V is also a metric connection on F, and the connections V/ are 
recovered by (4.36) and (4.37). - 

In general, for any two connections V and V, related by (4.39) for some 
End(£) valued 1-form C,, we have the following relation between their curva- 
ture tensors R and R, already anticipated in Exercise 2 of §1: 


(4.42) (R- R)(X,Y)u= {[Cx, Vy] - (Cy, Vx] -— Cix.yj}ut [Cx, Cy]u. 


In case V = V9 @ V! on E = Ep @ Ej, and V has the form (4.39), where Cx 
exchanges Ey and F, it follows that the operator in brackets { } on the right 
side of (4.42) exchanges sections of Eo and EF, while the last operator [Cx , Cy] 
leaves invariant the sections of Ho and F). In such a case these two components 
express respectively the Codazzi identity and Gauss’ Theorema Egregium. 


636 C. Connections and Curvature 


We will expand these formulas, writing R(X,Y) € End( Eo © £;) in the block 
matrix form 


Roo Ror 
(4.43) Re ( ) 
Rio Ria 


Then Gauss’ equations become 


(Roo — Ro)(X,Y)u = 17 TRu -— 16,114, 


(4.44) : : 
(Ri — Ri)(X,Y)u = IR Tipu — TEI Tu, 


for a section u of Ho or £1, respectively. Equivalently, if v is also a section of Eo 
or F1, respectively, 


((Roo — Ro)(X,Y)u,v) = (Iu, Tv) — (Eu, Irv), 


(4.45) 1 1 1 1 
(Ri — Ri)(X,Y)u, v) = UTyu, Lv) — (TTyu, ITyv). 


The second part of (4.45) is also called the Ricci equation. 
Codazzi’s equations become 


(4.46) 
Rio(X,Y)u = TX V¥-u — Ty Vu — Ty yyut+ Villyu — Vy lxu, 


Ro (X, Y)u = Ix Vyu— Ty Vu ly yyut V&llyu— Vy lixu, 


for sections u of Hp and F}, respectively. If we take the inner product of the first 
equation in (4.46) with a section v of F1, we get 


(Rio(X, Y)u, v) 
(4.47) = —(Vyu, xv) + (VSu, Ly-v) — IIPy yu, 0) 

+ (II u, Vyv) — IEu, Vyo) + XR, v) — YUE u,v), 
using the metric property of V° and V', and the antisymmetry of (4.40). If we 
perform a similar calculation for the second part of (4.46), in light of the fact that 
Rio(X,Y)* = —Roi(X,Y), we see that these two parts are equivalent, so we 
need retain only one of them. Furthermore, we can rewrite the first equation in 
(4.46) as follows. Form a connection on Hom(T'M ® Eo, F,) via the connections 


V! on fi; and a Levi—Civita connection V™ on TM, via the natural derivation 
property, that is, 


(4.48) (VxII)(Y,u) = VAI Ru — IVS u— 11(V MY, u). 
Then (4.46) is equivalent to 


(4.49) Rio(X,Y)u = (VxII®)(Y,u) — (Vy IT°)(X, u). 


4. Geometry of submanifolds and subbundles 637 


One case of interest is when F is the trivial bundle #; = M x R, with one- 
dimensional fiber. For example, £ could be the normal bundle of a codimension- 
one surface in R”. In this case, it is clear that both sides of the last half of (4.45) 
are tautologically zero, so Ricci’s equation has no content in this case. 

As a parenthetical comment, suppose F is a trivial bundle £F = M x R”, 
with complementary subbundles /’;, having metric connections constructed as in 
(4.36) and (4.37), from the trivial connection D on EF, defined by componentwise 
differentiation, so 


(4.50) Vou=PDxu, Vyu=(I-P)Dxu, 


for sections of Ho and £}, respectively. There is the following alternative 
approach to curvature formulas. For V = V° 6 V1, we have 


(4.51) Vxu = Dxu+ (DxP)(I —2P)u. 


Note that with respect to a choice of basis of R” as a global frame field on / x R”, 
we have the connection 1-form (1.13) given by 


(4.52) [= dP(I—2P). 


Since dP = dP P + P dP, we have dP P = (I — P)dP. Thus, writing the 
connection 1-form as T = PdP(I — P) — (I — P)dP P casts T = —C in the 
form (4.40). We obtain directly from the formula Q = dl’ +I AT, derived in 
(4.15), that the curvature of V is given by 


(453)  Q=dPAdP=PdPAdPP+(I-P)dPAdP(I-P), 


the last identity showing the respective curvatures of Eo and &,. Compare with 
Exercise 5 of §1. 

Our next goal is to invert the process above. That is, rather than starting with 
a flat bundle E = M x R” and obtaining connections on subbundles and second 
fundamental forms, we want to start with bundles LE; — M, j = 1,2, with 
metric connections V’, and proposed second fundamental forms JJ’, sections of 
Hom(7TM ® E;, E;,), and then obtain a flat connection V on EF via (4.38)-(4.40). 
Of course, we assume JJ° and JJ" are related by (4.41), so (4.39) makes V a 
metric connection. Thus, according to equations (4.45) and (4.49), the connection 
V is flat if and only if, for all sections u,v of Eo, 


(4.54) 
(VxII®)(Y,u) — (Vy II®)(X,u) =0, 
(o(X, Y )u,o) = (1a, TIS a) = IS a TT), 


and, for all sections u, v of FE, 


(4.55) (Ri (X,Y)u, v) = (Eu, Iv) — (Tu, Tv). 


638 C. Connections and Curvature 


If these conditions are satisfied, then & will have a global frame field of sec- 
tions €1,...,€,, such that Ve; = 0, at least provided M is simply connected. 
Then, for each p € M, we have an isometric isomorphism 


(4.56) J(p) : Ey —> R” 


by expanding elements of EF, in terms of the basis {e;(p)}. Thus Eo C E is car- 
ried by J(p) toa family of linear subspaces J(p)Eop = Vp C R”, with orthogonal 
complements J(p)E1, = Np C R”. 

We now specialize to the case ky = TM, where M is an m-dimensional 
Riemannian manifold, with its Levi—Civita connection; / is an auxiliary bundle 
over M, with metric connection V!. We will assume M is simply connected. The 
following result is sometimes called the fundamental theorem of surface theory. 
surface of surface theory 


Theorem 4.8. Let II° be a section of Hom(TM ® TM, FE), and set IIk = 
—(IT§.)*. Make the symmetry hypothesis 


(4.57) IPA ye Oe: 


Assume the equations (4.54) and (4.55) are satisfied, producing a trivialization of 
E = Eo ® Fj, described by (4.56). Then there is an isometric immersion 


(4.58) X:M—>R’", 


and a natural identification of E, with the normal bundle of S = X(M) Cc R", 
such that the second fundamental form of S is given by II°. 


To get this, we will construct the map (4.58) so that 


(4.59) DX (p) = Jp) | ays 


for all p € M. To see how to get this, consider one of the n components of 
J, J(p) : Ep + R. In fact, Jyu = (ey, u). Let 3,(p) = JL (p)|- 4,3 thus 6, isa 
1-form on M/. 


Lemma 4.9. Each (3, is closed, that is, dB, = 0. 


Proof. For vector fields _X and Y on M, we have 


dp, (X, Y) =X- BAY) = B(x) ~~ BUX, Y]) 


(4.60) : ; 
=X. BAY) =—— 2 By(X) a BUVXY _ VyX). 


Using Vx = V& + IT on sections of Ep = TM, we see that this is equal to 


Exercises 639 
X- SAY) -Y + A(X) -— I(VxY — Vy X)+ LUBY - 1X) 
=(VxJp)Y — (Vy J) X + JTL Y — LY X). 


By construction, Vx J, = 0, while (4.57) says II2- Y — II}. X =0. Thus d3, =0. 


Consequently, as long as M is simply connected, we can write 0, = dx, 
for some functions 7, € C%(M). Let us therefore define the map (4.58) by 
X(p) = (x1(p),...,vv(p)). Thus (4.59) holds, so X is an isometric mapping. 
Furthermore, it is clear that J(p) maps EF}, precisely isometrically onto the nor- 
mal space N, C R" to S = X(M) at X(p), displaying IJ° as the second 
fundamental form of S. Thus Theorem 4.8 is established. 

Let us specialize Theorem 4.8 to the case where dim M/ = n — 1, so the fibers 
of /; are one-dimensional. As mentioned above, the Ricci identity (4.55) has 
no content in that case. We have the following special case of the fundamental 
theorem of surface theory. 


Proposition 4.10. Let M be an (n — 1)-dimensional Riemann manifold; assume 
M is simply connected. Let there be given a symmetric tensor field II, of type 
(0,2). Assume the following Gauss—Codazzi equations hold: 


(RM(X,Y)Z,W) = II(Y, Z)II(X, W) — II(X, Z)II(Y, W), 


(4.61) ae = 
(VETI)(Y, Z) — (V¥ID)(X, Z) =0, 
where V™ is the Levi-Civita connection of M and R™ is its Riemann curvature 


tensor. Then there is an isometric immersion X : M_— R” such that the second 
fundamental form of S = X(M) C R” is given by IT. 


Exercises 


1. Let S C M, with respective Levi-Civita connections V°, V, respective Riemann 
tensors Rs, R, and so on, as in the text. Let ys,z : [a,b] — S be a two-parameter 
family of curves. One can also regard ys,z : [a,b] — M. Apply the formula (3.52) 
for the second variation of energy in these two contexts, and compare the results, to 
produce another proof of Gauss’ formula (4.14) for ((R — Rs)(X,Y)Z,W) when 
X,Y, Z, W are all tangent to S. 

2. With the Ricci tensor Ric given by (3.23) and the sectional curvature K’, (II) by (4.32), 
show that, for X € TM, of norm 1, if = denotes the orthogonal complement of X 
in T,M, then 


Ric(X, X) = TS) / K,(U, X) dV(U), 


vol S,)(& 
Sp(=) 


where S,,(=) is the unit sphere in =, n = dim M, and K,(U, X) = K,(I), where 
II is the linear span of U and X. Show that the scalar curvature at p is given by 


640 C. Connections and Curvature 


_ n(n — 1) 
S= i K,(I1) dv (1D), 
G2 


where G'2 is the space of 2-planes in T, M. 

3. Let 7 be acurve in R®, parameterized by arc length. Recall the Frenet apparatus. At 
p= y(t), T = y'(t) spans T,7y, and if the curvature « of is nonzero, unit vectors 
N and B span the normal bundle vp(7), satisfying the system of ODE 


T'= KN, 
(4.62) N’ =—-«T +7B, 
B= —TN, 


and furthermore, B = T x N, T = N x B, N = B x T. Compare with Exercises 
4-6 in Chap. 1, §5. Let V denote the standard flat connection on R*, and V°, V' the 
connections induced on T'(y) and v(y), as in (4.1), (4.8). Show that 


(4.63) IT(T,T) =KN 
and that 
V7N =7B, 
(4.64) ; 
V7B = —TN. 


Compute the right side of the Weingarten formula 
(4.65) Vr — Vr =—(r)’, 


and show that (4.63)—(4.65) are equivalent to (4.62). 

4. Let S C R? be a surface, with connection ve , second fundamental form IJ 2 , and 
unit normal v. Let y be the curve of Exercise 3, and suppose ¥ is a curve in 5. Show 
that 


IT? (T,T) = «K(N,v)v 
=KN —V3T. 


If A, denotes the Weingarten map of S, as in (4.5), show that 
ALT =«T —7B and N =p, 


provided 7¥ is a geodesic on S. 

5. Use Theorem 4.4 to show that the Gauss curvature K of a surface S C R? is equal 
to det A,. Use the symmetry of A, to show that each 7,5 has an orthonormal basis 
T,, Tz such that A, T; = «;7;; hence K = kik. An eigenvector of A, is called 
a direction of principal curvature. Show that T € T,S is a direction of principal 
curvature if and only if the geodesic through p in direction T' has vanishing torsion 7 
at p. 

6. Suppose M has the property that each sectional curvature K,,(II) is equal to Kp, 
independent of II. Show that 


R = KpI inEnd(A77,), 


Exercises 641 


where R is as in Exercise 4 of §3. Show that K, is constant, on each connected 
component of M, if dim M > 3. (Hint: To do the last part, use Proposition 3.3.) 

7. Show that the formula (4.42) for R — Ris equivalent to the formula (2.17). (This 
reiterates Exercise 5 of §2.) Also, relate (4.44) and (4.49) to (3.54). 
Let M be a compact, oriented hypersurface in R”. Let 


N:M—3s"? 


be given by the outward-pointing normal. This is called the Gauss map. 

8. If n = 3, show that N*wo = Kw, where wo and w are the area forms of S? and M, 
respectively, and ix is the Gauss curvature of M. Note that the degree of the Gauss 
map is 


1 * 
Deg(N) = roa wo. 
M 


See §19 of Chap. 1 for basic material on degrees of maps. 
9. For general n, show that N*wo = Jw, with 


J = (-1)""! det Ay, 


where w and wo are the volume forms and Ay : T,M — T,M is the Weingarten 
map (4.5). Consequently, 


n—1 
(4.66) Deg(N) = a / (det Ay) dV, 
n—1 


M 


where A,,_1 is the area of S”~+. (Hint: There is a natural identification of T;,M and 
T,(S”—') as linear subspaces of R”, if g = N(p). Show that the Weingarten formula 
gives 


(4.67) DN(p) = —An € End(T,M) & £L(T,)M,T,S"').) 


10. Let S be a hypersurface in IR”, with second fundamental form il, as in (4.15). Sup- 
pose II is proportional to the metric tensor, JJ = X(a)g. Show that is constant, 
provided S is connected. (Hint: Use the Codazzi equation (4.17), plus the fact that 
v° g=0.) = 

11. When S is a hypersurface in R”, a point p, where [J = Xg, is called an umbilic point. 
If every point on S is umbilic, show that S' has constant sectional curvature A. 
(Hint: Apply Gauss’ Theorema Egregium, in the form (4.14).) 

12. Let S C R” be a k-dimensional submanifold (k < n), with induced metric g and 
second fundamental form JJ. Let € be a section of the normal bundle v(S). Consider 
the one-parameter family of maps S — R”, 


(4.68) pr(t) =x4+7E(x), x ES, 7 E (-<,€). 


Let g, be the family of Riemannian metrics induced on S. Show that 


(4.69) (X,Y) = —2(€,11(X,Y)). 


— Gr 
dr T=0 


642 C. Connections and Curvature 


More generally, if S C M is a submanifold, consider the one-parameter family of 
submanifolds given by 


(4.70) v(x) = Exp,(ré(x)), «ES, 7 € (-€,€), 


where Exp, : ZT, — M4 is the exponential map, determined by the Riemannian 
metric on MM. Show that (4.69) holds in this more general case. 

13. Let M; C Mz C Msz3 be Riemannian manifolds of dimension ny < nz < n3, 
with induced metrics. For 7 < k, denote by IJ/* the second fundamental form of 
M; C Mx and by A* the associated Weingarten map. For x € M;, denote by N3* 
the orthogonal complement of T;, Mj in T; My, and by JEW" the natural connection on 
NI*(M;). Let X and Y be tangent to Mj, and let € be a section of N!?(M,). Show 
that 

AX oAP Ss. 


Also show that 
BYE = PVKE+ II (X, €), orthogonal decomposition, 
and that 
II? (X,Y) = 11"? (X,Y) + I1*(X,Y), orthogonal decomposition. 


Relate this to Exercises 3-5 when nj = j. 
14. If S C M has codimension | and Weingarten map A : T,S — T,,S, show that the 
Gauss equation (4.14) gives 


(4.71) ((R—Rs)(X,Y)Z,W) =((MWAXAY),ZAW), X,Y,Z,W €T,S. 


Show that (with N a unit normal to S) the scalar curvatures of M and S are related 
by 


(4.72) Su — Ss = —2Tr A?A + 2Ricas(N, N). 


5. The Gauss—Bonnet theorem for surfaces 


If M is a compact, oriented Riemannian manifold of dimension 2, the Gauss— 
Bonnet theorem says that 


(5.1) [« dV =2n x(M), 
M 


if K is the Gauss curvature of M and y(M) its Euler characteristic. There is 
an associated formula if M/ has a boundary. There are a number of significant 
variants of this, involving, for example, the index of a vector field. We present 
several proofs of the Gauss—Bonnet theorem and some of its variants here. 

We begin with an estimate on the effect of parallel translation about a small 
closed, piecewise smooth curve. This first result holds for a general vector bundle 
E — M with connection V and curvature 


5. The Gauss—Bonnet theorem for surfaces 643 
1 [ea 
aah pik dx; \dxp, 


with no restriction on dim M. 
Proposition 5.1. Let y be a closed, piecewise smooth loop on M. Assume it is 


parameterized by arc length for 0 < t < b, y(b) = ¥(0). If u(t) is a section of E 
over ¥ defined by parallel transport (i.e., Vru = 0, T = +), then 


(5.2)  u®(b) — u®(0) = -5 S- Re a5 ew A dx | u®(0) + O(03), 
j,k,B ¥ 


where A is an oriented 2-surface in M with 0A = 4, and the u(t) are the 
components of u with respect to a local frame. 


Proof. If we put a coordinate system on a neighborhood of p = y(0) € M and 
choose a frame field for F,, then parallel transport is defined by 


(5.3) Sh —Pg,u anna 


As usual, we use the summation convention. Thus 


(5.4) u(t) = u(0) a Tg (y(s))u?(s) oe ds. 


We hence have 

(5.5) u(t) = u%(0) —Pex(p)u? (0) (ae — pe) + O(t?). 
We can solve (5.3) up to O(t?) if we use 

(5.6) P%aj() =I 45 (p) + (te — Pe) Ep; + O(|x — pl). 


Hence 


If y(b) = (0), we get 


644 C. Connections and Curvature 


b 
u*(b) = u*(0) -f x; dp, (0,0 ex) u? (0) 
(5.8) : 


b 
y x; dx. 541" 47u7 (0) + O(8"), 
0 


the components of I and their first derivatives being evaluated at p. Now Stokes’ 


theorem gives 
fo dz, = pa A d&r, 
A 


Y 
sO 


(5.9) u%(b) — u%(0) = [-O,0% gx +P% 440g 5] / dx; \ dxz u8(0) + O(b3). 
A 


Recall that the curvature is given by 
Q=dr+ay’r, 


that is, 


(5.10) R aik = OT bk _ OWL gj + P55 P7 gx = Pe yp 07 gj. 


Now the right side of (5.10) is the antisymmetrization, with respect to 7 and k, of 
the quantity in brackets in (5.9). Since [ 4 a2; \ dx,x is antisymmetric in j and k, 
we get the desired formula (5.2). 


In particular, if dim M = 2, then we can write the End(£)-valued 2-form Q as 


(5.11) Q= Ruy, 


where ju is the volume form on M and R is a smooth section of End(£) over M. 
If & has an inner product and V is a metric connection, then R is skew-adjoint. 
If > is a geodesic triangle that is “fat” in the sense that none of its angles is small, 
(5.2) implies 


(5.12) u(b) — u(0) = —Ru(0)(Area A) + O((Area A)*/?). 

If we specialize further, to oriented two-dimensional M with EF = TM, 
possessing the Levi—Civita connection of a Riemannian metric, then we take 
J : TM — T,M to be the counterclockwise rotation by 90°, which defines 
an almost complex structure on /. Up to a scalar this is the unique skew-adjoint 
operator on T;,M, and, by (3.34), 

(5.13) Ru=—-K Ju, weT,M, 


where K is the Gauss curvature of M at p. Thus, in this case, (5.12) becomes 


5. The Gauss—Bonnet theorem for surfaces 645 


ey, 


V 
on-B-y-E) 


FIGURE 5.1 Parallel Transport Around a Triangle 


(5.14) u(b) — u(0) = K Ju(0)(Area A) + O((Area A)?/?), 


On the other hand, if a tangent vector Xq € T,,M undergoes parallel transport 
around a geodesic triangle, the action produced on T;,M is easily seen to be a 
rotation in T;,M through an angle that depends on the angle defect of the triangle. 
The argument can be seen by looking at Fig. 5.1. We see that the angle from Xo 
to X3 is 


(5.15) (r+ a)—-(2n-B-y-£)-E=a+64+ 7-7. 


In this case, formula (5.14) implies 
(5.16) a+ p+y—a= f Kav +0((Area A)*?). 


We can now use a simple analytical argument to sharpen this up to the following 
celebrated formula of Gauss. 


Theorem 5.2. /f A is a geodesic triangle in M?, with angles a, 3, and y, then 


(5.17) atpty—a= f Kav. 
A 


646 C. Connections and Curvature 


Proof. Break up the geodesic triangle A into N? little geodesic triangles, each of 
diameter O(N~'), area O(N~7). Since the angle defects are additive, the esti- 
mate (5.16) implies 


at+B+y-T= [Rav won") 


(5.18) 


A 
= [« dV + O(N"), 
A 


and passing to the limit as N — oo gives (5.17). 


Note that any region that is a contractible geodesic polygon can be divided 
into geodesic triangles. If a contractible region 2 C M with smooth boundary 
is approximated by geodesic polygons, a straightforward limit process yields the 
Gauss—Bonnet formula 


(5.19) [Rave [nds=2n, 
Q 


0a. 


where « is the geodesic curvature of OQ. We leave the details to the reader. 
Another proof will be given at the end of this section. 

If M is a compact, oriented two-dimensional manifold without boundary, we 
can partition M into geodesic triangles. Suppose the triangulation of M/ so pro- 
duced has 


(5.20) F faces (triangles), Hedges, V vertices. 


If the angles of the jth triangle are aj, 3;, and 7, then summing all the angles 
clearly produces 27V. On the other hand, (5.17) applied to the 7th triangle, and 
summed over 7, yields 


(5.21) S (aj +83 +73) =F + f Kav. 
J M 


Hence ie K dV = (2V — F)rz. Since in this case all the faces are triangles, 
counting each triangle three times will count each edge twice, so 3F' = 2E. Thus 
we obtain 


(5.22) [w= Qn(V —E+F). 
M 


This is equivalent to (5.1), in view of Euler’s formula 


(5.23) y(M)=V-E+F. 


5. The Gauss—Bonnet theorem for surfaces 647 


FIGURE 5.2 Surface with Handles 


We now derive a variant of (5.1) when M is described in another fashion. 
Namely, suppose M/ is diffeomorphic to a sphere with g handles attached. The 
number g is called the genus of the surface. The case g = 2 is illustrated in 
Fig. 5.2. We claim that 


(5.24) [K dV = 4r(1—gQ) 
M 


in this case. By virtue of (5.22), this is equivalent to the identity 
(5.25) 2-29=V-E+F= (mM). 


Direct proofs of this are possible, but we will provide a proof of (5.24), based on 
the fact that 


(5.26) / K dV =C(M) 
M 


depends only on M, not on the metric imposed. This follows from (5.22), by 
forgetting the interpretation of the right side. The point we want to make is, given 
(5.26)—that is, the independence of the choice of metric—we can work out what 
C(M) is, as follows. 

First, choosing the standard metric on S 2 for which K = 1 and the area is 47, 
we have 


(5.27) / K dV =4n. 


S2 


Now suppose M is obtained by adding g handles to S?. Since we can alter the 
metric on M at will, we can make sure it coincides with the metric of a sphere near 
a great circle, in a neighborhood of each circle where a handle is attached to the 
main body A, as illustrated in Fig. 5.2. If we imagine adding two hemispherical 


648 C. Connections and Curvature 


caps to each handle H,;, rather than attaching it to A, we turn each H; into a new 
sphere, so by (5.27) we have 


(5.28) 4n = / KdV = Kav + f Kav. 


Hj U caps A; caps 


Since the caps fit together to form a sphere, we have ie K dV = An, so for each 
Dd 


(5.29) [« dV =0, 
A; 


provided M has a metric such as described above. Similarly, if we add 2g caps to 
the main body A, we get a new sphere, so 


(5.30) An = / KdV = [K dV + 2g(2n), 
AU caps A 
or 
(5.31) [« dV = 2n(2 — 2g). 
A 


Together (5.29) and (5.31) yield (5.24), and we get the identity (5.25) as a bonus. 

We now give another perspective on Gauss’ formula, directly dealing with the 
fact that TM can be treated as a complex line bundle, when V is an oriented Rie- 
mannian manifold of dimension 2. We will produce a variant of Proposition 5.1 
which has no remainder term and which hence produces (5.16) with no remain- 
der, directly, so Theorem 5.2 follows without the additional argument given above. 
The result is the following; again dim J is unrestricted. 


Proposition 5.3. Let E — M be a complex line bundle. Let y be a piecewise 
smooth, closed loop in M, with y(0) = y(b) = p, bounding a surface A. Let V 
be a connection on E, with curvature Q. If u(t) is a section of E over y defined 
by parallel translation, then 


(5.32) u(b) = |exp - fo u(0). 


Proof. Pick a nonvanishing section (hence a frame field) € of F over S, assuming 
S is homeomorphic to a disc. Any section u of E over S is of the form u=v& 
for a complex-valued function v on S. Then parallel transport along y(t) = 
(x1 (t),...,@n(t)) is defined by 


5. The Gauss—Bonnet theorem for surfaces 649 
du dx}, 
(5.33) — =— (vr. 3) v. 


The solution to this single, first-order ODE is 


(5.34) v(t) = lexp (- | “Ta(y(s)) 8 as)| v(0). 


Hence 
(5.35) u(b) = | exp -fr v(0), 
¥ 
where [ = )°T;, day. The curvature 2-form 2 is given, as a special case of 
(5.10), by 
(5.36) Q=dI, 


and Stokes’ theorem gives (5.32), from (5.35), provided A is contractible. The 
general case follows from cutting A into contractible pieces. 


As we have mentioned, Proposition 5.3 can be used in place of Proposition 5.1, 
in conjunction with the argument involving Fig. 5.1, to prove Theorem 5.2. 

Next, we relate [ 22 to the “index” of a section of a complex line bundle 
Ei — M, when M is a compact, oriented manifold of dimension 2. Suppose X is 
a section of E over M \ S, where S consists of a finite number of points; suppose 
that X is nowhere vanishing on M \ S and that, near each p, € S, X has the 
following form. There are a coordinate neighborhood ©; centered at p,;, with p; 
the origin, and a nonvanishing section & of F near p;, such that 


(5.37) X = vj5&; on O;, Uj :O;\p>C\0. 
Taking a small counterclockwise circle 7; about pj, v;/|v;| = w; maps 7; to St; 
consider the degree 4 of this map, that is, the winding number of 7; about S$". 


This is the index of X at p;, and the sum over 9; is the total index of X: vectorof 
a vector field 


(5.38) Index(X) = S- l;. 
J 


We will establish the following. 


Proposition 5.4. For any connection on E — M, with curvature form Q and X 
as above, we have 


650 C. Connections and Curvature 


(5.39) Jo = —(2mi)- Index(X). 
M 


Proof. You can replace X by a section of EF \ 0 over M \ {p; }, homotopic to the 
original, having the form (5.37) with 


(5.40) vj = ees Wj 


in polar coordinates (7, @) about p;, with w; € C'(O,;), w;(0) = 0. Excise small 
disks D; containing p;; let D = UD;. Then, by Stokes’ theorem, 


(5.41) / a=-> fr, 


M\D Jon; 


where 7; = OD, and I is the connection 1-form with respect to the section _X, so 
that, with Vz = Vp,, Dx = 0/Ox;, in local coordinates, 


(5.42) V,A =p. 


Now (5.37) gives (with no summation) 


(5.43) Dev by = (Ogg + gl 34); 


on Djs where Tye dz,, is the connection 1-form with respect to the section €;. 
Hence 


(5.44) Ty, = v7 ' Ov; + Dye, 
with remainder term I';, € C!(O;). By (5.40), we have 


(5.45) / T = 2ril; + O(r) 


VG 


if each D; has radius < C’'r. Passing to the limit as the disks D; shrink to p; gives 
(5.39). 


Since the left side of (5.39) is independent of the choice of X, it follows that 
the index of X depends only on F, not on the choice of such X. In Chap. 10, 
this formula is applied to a meromorphic section of a complex line bundle, and in 
conjunction with the Riemann—Roch formula it yields important information on 
Riemann surface theory. 

In case M is a compact, oriented Riemannian 2-manifold, whose tangent bun- 
dle can be given the structure of a complex line bundle as noted above, (5.39) is 
equivalent to 


5. The Gauss—Bonnet theorem for surfaces 651 


(5.46) [« dV = 2m Index(X), 
M 


for any smooth vector field X, nonvanishing, on M minus a finite set of points. 
This verifies the identity 


(5.47) Index(X) = (M) 


in this case. 

As a further comment on the Gauss—Bonnet formula for compact surfaces, let 
us recall from Exercise 8 of §4 that if M/ is a compact, oriented surface in R°, 
with Gauss map N : M -+ S?, then 


1 1 
(5.48) Deg(N) = zz [ Nim = [Kw 
An Ar 
M M 


Furthermore, in §20 of Chap. 1, Corollary 20.5 yields an independent proof that, 
in this case, 


(5.49) Deg(N) = : Index(X), 


for any vector field _X on M with a finite number of critical points. Hence (5.48)- 
(5.49) provide another proof of (5.1), at least for a surface in R°. This line of 
reasoning will be extended to the higher-dimensional case of hypersurfaces of 
IR" *1, in the early part of §8, as preparation for establishing the general Chern— 
Gauss—Bonnet theorem. 

To end this section, we provide a direct proof of the formula (5.19), using an 
argument parallel to the proof of Proposition 5.3. Thus, assuming that // is an 
oriented surface, we give T'M the structure of a complex line bundle, and we 
pick a nonvanishing section € of TM over a neighborhood of 0. Let y = 0O 
be parameterized by arc length, T = y/(s), 0 < s < b, with y(b) = ¥(0). The 
geodesic curvature « of 7, appearing in (5.19), is given by 


(5.50) VrTl=«N, N=JT. 


If we set T = u€, where u : © -> C, then, parallel to (5.33), we have (5.50) 
equivalent to 


du dxp, ie 8 
(5.51) = STs 7a wt iu. 


The solution to this single, first-order ODE is (parallel to (5.34)) 


652 C. Connections and Curvature 


(5.52) u(t) = lex ( [ k(s) ds — fn (7(s)) oe as)| u(0). 
Hence 
(5.53) u(b) = |exp ifr fo u(0). 


By (5.13), we have 
(5.54) Q = -iK dv, 


and since u(b) = u(0), we have 


(5.55) exp if mls as+i f Kav =1, 
Y oO 
or 
(5.56) [« av + [ x(s) ds = 2nv, 
oO Y 


for some v € Z. Now if O were a tiny disc in M, it would be clear that v = 1. 
Using the contractibility of O and the fact that the left side of (5.56) cannot jump, 
we have v = 1, which proves (5.19). 


Exercises 


1. Given a triangulation of a compact surface 7, within each triangle construct a vector 
field, vanishing at seven points as illustrated in Fig.5.3, with the vertices as attrac- 
tors, the center as a repeller, and the midpoints of each side as saddle points. Fit these 
together to produce a smooth vector field X on M. Show directly that 


Index (X) =V—-—E+F. 


2. Let L — M be a complex line bundle, and let u and v be sections of L with a finite 
number of zeros. Show directly that u and v have the same index. (Hint: Start with 
u = fuvon M \ Z, where Z is the union of the zero sets and f : M \ Z + C\ 0.) 

3. Let M, and Mz be n-dimensional submanifolds of R*. Suppose a curve ¥ is contained 
in the intersection M4, M Me, and assume 


p= 7(s) TpM, = TpMo2. 


Exercises 653 


Show that parallel translations along y in M) and in Mp coincide. (Hint: If T = y'(s) 
and X is a vector field along y, tangent to M, (hence to M2), show that Vex = 
ve using Corollary 4.2.) 

. Let © be the region in S* C R® consisting of points in S? of geodesic distance < r 
from p = (0,0,1), where r € (0,7) is given. Let y = OO. Construct a cone, with 
vertex at (0,0,sec r), tangent to S? along y. Using this and Exercise 3, show that 
parallel translation over one circuit of + is given by 


counterclockwise rotation by 6 = 27(1 — cosr). 


(Hint: Flatten out the cone, as in Fig.5.4. Notice that 7 has length @ = 27 sin r.) 
Compare this calculation with the result of (5.32), which in this context implies 


u(£) = }expi | kK dV} u(0). 
! 


. Let 7 : [a,b] — R® be a smooth, closed curve, so y(a) = y(b) and y/(a) = 7'(b). 
Assume 7 is parameterized by arc length, so y'(t) = T(t) and T : [a, b] + S?; hence 
T is a smooth, closed curve in S$”. Note that the normal space to y at p = 7(t) is 
naturally identified with the tangent space to S? at T(t) = q: 


Yp(y) = Le 


FIGURE 5.3 Vector Field on a Triangulation 


tanr u(/) 


u(0) 


TT CK 


(2x — 6) tanr=/=2nsinr 


FIGURE 5.4 Cone Tangent to a Sphere 


654 C. Connections and Curvature 


a) Show that parallel translation along + of a section of the normal bundle v(7), with 
respect to the connection described in Exercise 3 of §4, coincides with parallel transla- 
tion along the curve T of vectors tangent to S*. (Hint: Recall Exercise 3 of §1.) 

b) Suppose the curvature « of 7 never vanishes, so the torsion 7 is well defined, as in 
(4.62). Show that parallel translation once around ¥ acts on p(y) by multiplication by 


exp -i f m6) ds 


y 


Here we use the complex structure on vp(y) given by JN = B, JB = —N. (Hint: 
Use (4.64).) Compare the results of parts a and b. 


6. The principal bundle picture 


An important tool for understanding vector bundles is the notion of an underlying 
structure, namely that of a principal bundle. If M is a manifold and G a Lie group, 
then a principal G-bundle P 4, Misa locally trivial fibration with a G-action on 
P such that G acts on each fiber P, = p~‘(ax) in a simply transitive fashion. An 
example is the frame bundle of an oriented Riemannian manifold M, F(M) > 
M, where F:,(M) consists of the set of ordered oriented orthonormal bases of 
the tangent space T;, to M at x. If n = dim M, this is a principal SO(n)-bundle. 


If P — M isa principal G-bundle, then associated to each representation 7 of 
G on a vector space V is a vector bundle E —> M. The set E is a quotient space 
of the Cartesian product P x V, under the equivalence relation 


(6.1) (y,v) ~(y-g,7(g)'v), g EG. 


We have written the G-action on P as a right action. One writes EK = P x, V. 
The space of sections of F is naturally isomorphic to a certain subspace of the 
space of V-valued functions on P: 


(6.2)  C®(M,E) = {ue C%(P,V): u(y-g) = x(g) uly), g € Gh. 


We describe how this construction works for the frame bundle F'(1/) of an 
oriented Riemannian manifold, which, as mentioned above, is a principal SO(n)- 
bundle. Thus, a point y € F,,(/) consists of an n-tuple (e1,..., en), forming an 
ordered, oriented orthonormal basis of T,,M. If g = (jx) € SO(n), the G-action 
is given by 


(6.3) (€1,---,€n):g =(fi,---5 fn); fi = >< gejee. 
e 


6. The principal bundle picture 655 


One can check that (f1,..., f,) is also an oriented orthonormal basis of T;, M and 
that (y-g)-g' = y- (gg’), for g, g’ € SO(n). If z is the “standard” representation 
of SO(n) on R”, given by matrix multiplication, we claim that there is a natural 
identification 


(6.4) F(M) x, R" TM. 


In fact, if y = (€1,...,€n) € F,(M) and v = (v,...,Un) € R”, the map (6.4) 
is defined by 


(6.5) (y,v) + So uje; € Ty M. 
j 


We need to show that this is constant on equivalence classes, as defined by (6.1), 
that is, for any g € SO(n), 


(66) z=y-g=(fi,--.fn)) w= (9) vu > > wehe = >_ rye; 


In fact, setting g~' = h = (hjz,), we see that 


(6.7) x write = S- hKjVjGekee = y, b ej Vj ee 
k Uke jie 
since 
(6.8) S- geeheg = 505, 
k 


and this implies (6.6). 

Connections are naturally described in terms of a geometrical structure on a 
principal bundle. This should be expected since, as we saw in §1, a connection 
on a vector bundle can be described in terms of a “connection 1-form” (1.14), 
depending on a choice of local frame for the vector bundle. 

The geometrical structure giving a connection on a principal bundle P — M 
is the following. For each y € P, the tangent space T’,P contains the subspace 
V,P of vectors tangent to the fiber p~'(x), x = p(y). The space V,,P, called the 
“vertical space,” is naturally isomorphic to the Lie algebra g of G. A connection 
on P is determined by a choice of complementary subspace, called a “horizontal 
space”: 


(6.9) TyP =V,P & A,P, 
with the G-invariance 


(6.10) gx(HyP) = Hy.P, 


656 C. Connections and Curvature 


where g, : TP — Ty.,P is the natural derivative map. 

Given this structure, a vector field X on M has a uniquely defined “lift” X to 
a vector field on P, such that p,.X, = Xz (x = p(y)) and X, € H,P for each 
y € P. Furthermore, if is a vector bundle determined by a representation of G 
and u € C®(M, V) corresponds to a section v of F, the V-valued function X - u 
corresponds to a section of &', which we denote V xv; V is the covariant deriva- 
tive on E defined by the connection on P just described. If V has an inner product 
and 7 is unitary, & gets a natural metric and V is a metric connection on EL. 

If the 7; are representations of G on V;, giving vector bundles Ek; > M 
associated to a principal bundle P — M with connection, then 7, ® m2 is a 
representation of G on V; ® V2, and we have a vector bundle E — M, E = 
£, ® E,. The prescription above associating a connection to F as well as to Ey; 
and 2 agrees with the definition in (1.29) of a connection on a tensor product of 
two vector bundles. This follows simply from the derivation property of the vector 
field X, acting as a first-order differential operator on functions on P. 

The characterization (6.9)—(6.10) of a connection on a principal bundle P > M 
is equivalent to the following, in view of the natural isomorphism V,,.P ~ g. The 
splitting (6.9) corresponds to a projection of TP onto V,P, hence to a linear 
map T’,P — g, which gives the identification V,P * g on the linear subspace 
V,P of T,yP. This map can be regarded as a g-valued 1-form € on P, called the 
connection form, and the invariance property (6.10) is equivalent tol-form 


(6.11) ge = Ad,-1€, ge G, 


where g*€ denotes the pull-back of the form €, induced from the G-action on P. 

The Levi—Civita connection on an oriented Riemannian manifold gives rise 
to a connection on the frame bundle F(Z) — M in the following way. Fix 
y € F(M), x = p(y). Recall that the point y is an ordered (oriented) orthonormal 
basis (€1,...,€n) of the tangent space TM. The parallel transport of each e; 
along acurve ¥ through z thus gives a family of orthonormal bases for the tangent 
space to M at y(t), hence a curve y* in F'(M) lying over . The tangent to y* 
at y belongs to the horizontal space H,F'(/), which in fact consists of all such 
tangent vectors as the curve y through x is varied. This construction generalizes 
to other vector bundles EH — M with connection V. One can use the bundle of 
orthonormal frames for EF if V is a metric connection, or the bundle of general 
frames for a general connection. 

Let us restate how a connection on a principal bundle gives rise to connections 
on associated vector bundles. Given a principal G-bundle P — M, consider a 
local section 0 of P, over U C M. If we have a representation 7 of G on V, the 
associated vector bundle E —> M, and a section u of FE, then we have uoo : 
U — V, using the identification (6.2). Given a connection on P, with connection 
1-form €, we can characterize the covariant derivative induced on sections of E' by 


(6.12) (Vxu)oa=Lyx(uoo)+T(X)ucog, 


6. The principal bundle picture 657 


where £x acts componentwise on uo a, and 
(6.13) T(X) = (dn) (E,(X)), y=o(x), ¥ = Do(a)Xx, 


dn denoting the derived representation of g on V. That (6.12) agrees with 
(L ad) o o follows from the chain rule, the fact that X — X is vertical, and the 
fact that if v € T,P is vertical, then, by (6.2), £y»u = —dm(E(v))u. Note the 
similarity of (6.12) to (1.7). 

Recall the curvature R(X, Y) of a connection V on a vector bundle E > M, 
defined by the formula (1.10). Incase FE = P x, V, and Vu is defined as above, 
we have (using the identification in (6.2)) 


(6.14) RX, Y)u = Liz gu — Lagu 


Alternatively, using (6.12) and (6.13), we see that the curvature of V is given by 
R(X, Y)uoae 


(6.15) _ {exr(¥) — LyT(X) + [P(X), T(Y)] -T([X, Y]) bu oo. 


This is similar to (1.13). Next we want to obtain a formula similar to (but more 
fundamental than) (1.15). 

Fix y € P, x = p(y). It is convenient to calculate (6.15) at x by picking the 
local section o to have the property that 


(6.16) Do(«):T,M —> H,P, 
which is easily arranged. Then X = X at y, so I'(X) = Oat y. Hence, at x, 
R(X,Y)uog = {LxI(Y) -LyIT(X)}uoe 


(6.17) = (dn){X -€(Y) -Y¥ -&(X)}uoe 
= (dm) {(do*€)(X,Y) + (0 €)([X, Y]) }uoo. 


Of course, o*€ = 0 at x. Thus we see that 

(6.18) R(X,Y)u = (dr){ (dé)(X, Y) }u, 
at y, and hence everywhere on P. In other words, 

(6.19) R(X, Y) = (dm)(Q(X,Y)), 
where (2 is the g-valued 2-form on P defined by 


(6.20) O(X# YH) = (df)(xX*, x¥*), 


658 C. Connections and Curvature 


for X#,Y# € T,P. Here, x is the projection of T, P onto H,,P, with respect to 
the splitting (6.9). One calls 9 the curvature 2-form of the connection € on P. 
If V and W are smooth vector fields on P, then 


(6.21) (dé)(V,W) =V -&(W) -W- &(V) —€([V, W)). 


In particular, if V = xX ,>W= Y are horizontal vector fields on P, then since 
&(X) = &(Y) = 0, we have 


(6.22) (dé)(X,¥) = -€([X, Y]). 
Hence, given X#, Y#eE Ty P, we have 
(6.23) Ux*,y#) = —€([X, ¥]), 


where X and Y are any horizontal vector fields on P such that X = xX*# and 
Y = xY*% at y € P. Since € annihilates [X, Y] if and only if it is horizontal, we 
see that 2. measures the failure of the bundle of horizontal spaces to be involutive. 

It follows from Frobenius’s theorem that, if 2 = 0 on P, there is an integral 
manifold S C P such that, for each y € S, Ty,.S = H,P. Each translate S'- g is 
also an integral manifold. We can use this family of integral manifolds to construct 
local sections v1,...UK of E (Kk = dim V), linearly independent at each point, 
such that Vu; = 0 for all 7, given that 2 = 0. Thus we recover Proposition 1.2, 
in this setting. 

The following important result is Cartan’s formula for the curvature 2-form. 


Theorem 6.1. We have 
1 
(6.24) Q= dé + SIE. 6]: 


The bracket [€,7] of g-valued 1-forms is defined as follows. Suppose, in local 
coordinates, 


(6.25) = day n=) eden: tite eo: 

Then we set 

(6.26) [£,7|= Solis ne] dx; A dx, = S>([&, 76] + [1nj,€n]) dx; A daze, 
ik j<k 


which is a g-valued 2-form. Equivalently, if U and V are vector fields on P, 
(6.27) [é,ml(U,V) = [(U), nV] + [n(U), €(V)].- 


In particular, 


6. The principal bundle picture 659 


(6.28) sl U,V) = 60), £0) 


Note that if 7 is a representation of G on a vector space V and dz the derived 
representation of g on V, if we set A; = dr(€;), then, for 


(6.29) dn(¢) =a = 7A; dey, 
we have 


1 
(6.30) aNa= 5° AjAg da; \ day = 5 S 5 (AjAn — An Ay) dary A dag. 
jik j,k 


Hence 


(6.31) iho= sani é. 


Thus we see the parallel between (6.24) and (1.15). 

To prove (6.24), one evaluates each side on (X*,Y*), for X#,Y# € T,P. 
We write X# = X + X,, with X € H,P, X, € V,P, and similarly write 
Y#=YH+ Y,. It suffices to check the following four cases: 


(6.32) OY) MOLY); UGG.) Ea). 
Without loss of generality, one can assume that X and Y are horizontal lifts of 


vector fields on MW and that €(X,,) and €(Y,,) are constant g-valued functions on 
P. By (6.20) and (6.28), we have 


so (6.24) holds in this case. Next, clearly 


(6.34) UX,¥,)=0, [€(X),€(%,)] =0, 
while 
(6.35) dé(X,Y,) =X -€(%) — ¥, - €(X) — (X,Y). 


Now, having arranged that €(Y,,) be a constant g-valued function on P, we have 
that X - €(Y,) = 0. Of course, Y, - €(X) = 0. Also, [X,Y,] = —Ly,X is 
horizontal, by (6.10), so € (Xx ; %)) = 0. This verifies (6.24) when both sides act 
on (X,Y,), and similarly we have (6.24) when both sides act on (X,,Y). We 
consider the final case. Clearly, 


660 C. Connections and Curvature 

(6.36) O(Xy, Yo) = 0, 

while 

(6.37) dé(Xy, Yi) = Xe t(Vo) — Yo 6) Ea, Yo) = —€ (1, Ya) 


and 


638) 5 1E £1 (Xe, Yo) = (Xv), €(%)] = (Xe. Yel) 


so (6.24) is verified in this last case, and Theorem 6.1 is proved. 
We next obtain a form of the Bianchi identity that will play an important role 
in the next section. Compare with (1.40) and (2.13). 


Proposition 6.2. We have 
(6.39) dQ = [Q, €]. 


Here, if Q = $7 Oj, dx; \ dx, in local coordinates, we set 


[Q, €] = be [Qyx,€e] dx; A dx, A dae 
ee, 


=— J [ée, 25x] dae A da; A day, = —[€,Q]. 
Uke 


(6.40) 


To get (6.39), apply d to (6.24), obtaining (since dd& = 0) 


(641) a= 5 [dé €| — S16, ae] = [ae 
which differs from [Q, €] by (1/2) |[&, €], €]. We have 


(6.42) [[é, 1,6] = SUG, Ex), Ee] dvj A day, A dre. 


Ike 
Now cyclic permutations of (j,k, 2) leave dz; A dx, A dxe invariant, so we can 


replace [[£;, €],€e] in (6.42) by the average over cyclic permutations of (J, k, 2). 
However, Jacobi’s identity for a Lie algebra is 


1&5, x], €e] + [[Ex, €e], €s] + [lEe, €3], &] = 9, 


so [[E, €], €] = 0, and we have (6.39). 


6. The principal bundle picture 661 


Exercises 


1. Let P+ M be a principal G-bundle with connection, where M is a Riemannian man- 
ifold. Pick an inner product on g. For y € P, define an inner product on TyP = 
V,P © H,P so that if Z € T,P has decomposition Z = Z, + Zp, then 


ZI? = l€(Zeo)II? + | Dp(w)Zall?. 


Show that this is a G-invariant Riemannian metric on P. 

2. Conversely, if P *+ M is a principal G-bundle, and if P has a G-invariant Riemannian 
metric, show that this determines a connection on P, by declaring that, for each y € P, 
H, P is the orthogonal complement of V,, P. 

3. A choice of section o of P over an open set U C M produces an isomorphism 


(6.43) AO, B35 C70, 1). 


If & is another section, there is a smooth function g : U — G such that 


(6.44) a(x) =o(x)- g(x), VaEeU. 
Show that 
(6.45) ja jg 'v(x) = 1(g(x))'v(a). 


4. According to (6.12), ifu € C’(U, E) and v = jou, 0 = jeu, we have 
(6.46) (Vxu)oo=X-v4+I(X)u, (Vxu)oG=X-54+T(X)é. 


Show that 


-1 


(6.47)  T(X)=n(9(x)) “TP (X)n(g(x)) + dr(DdAgixy(g(x)) 0 Dg(a)X), 


where Dg(x)X € Ty2yG, Ag(h) = go'h, Drdg(g) : TyG + TeG & g. 
Compare with (1.41). (Hint: Make use of (6.11), plus the identity (dw)(Ad,-1.A) = 
x(g)-*dn(A)m(g), A € g.) _ 

5. Show that, for X,Y vector fields on M, Q(X, Y) satisfies 


(6.48) Q(X, Y)(y-g) = Ad(g) Q(X, Y). 

Deduce that setting 
(6.49) 0° (X,Y) = O(X,Y) 

defines 9? as a section of A?T* ® (Ad P), where Ad P is the vector bundle 
(6.50) Ad P =P Xaag. 


6. If €) and &; are connection 1-forms on P — M, show that t€, + (1 — t)€o is also, for 
any t € R. (Hint: If Po and P, are projections, show that tP; + (1 — t)Pp is also a 
projection, provided that Py and P; have the same range.) 

7. Let €o and & be two connection 1-forms for P — M, and let V be an arbitrary third 
connection on P. Consider 


(6.51) a= &1 — £0. 


If X is a vector field on M and X the horizontal lift determined by V, show that 


662 C. Connections and Curvature 


(6.52) a®(X) = a(X) 


defines a? as an element of C’°(M, A'T* @ Ad P). Show that a” is independent of 
the choice of V. 

8. In the setting of Exercise 7, if (2; are the curvatures of the connection 1-forms €;, show 
that 


(6.53) Qy — Qo = da + [a, £0] 4 sla.a]. 


Compare with (2.17) and (3.62). If dY a? is the (Ad P)-valued 2-form defined as in §2, 
via the connection 9, relate dV a? to da + [a, £0]. 


7. The Chern—Weil construction 


Let P + M be a principal G-bundle, endowed with a connection, as in §6. Let 
Q be its curvature form, a g-valued 2-form on P; equivalently, there is the Ad P- 
valued 2-form 2° on M. The Chern—Weil construction gives closed differential 
forms on M, whose cohomology classes are independent of the choice of con- 
nection on P. These “characteristic classes” are described as follows. 

A function f : g — C is called “invariant” if 


(7.1) f(Ad(g)X) = f(X), Xeg, gEG. 


Denote by Z;, the set of polynomials p : g — C which are invariant and homoge- 
neous of degree k. If p € Ty, there is associated an Ad-invariant k-linear function 
P on g, called the polarization of p, given by 


1 of 
+--+ +t Yq), 


(7.2) PUiy ane) = = 55.85 PaM 


such that p(X) = P(X,...,X). Into the entries of P we can plug copies of Q, 
or of (2°, to get 2k-forms 


(7.3) p(Q) = P(Q,...,9) © A?*P 
and 
(7.4) n(O?) = PO? ..., 0°) € APM. 


Note that if 7 : P — M is the projection, then 


(7.5) p(Q) = r*p(Q"); 


7. The Chern—Weil construction 663 
we say p(Q), a form on P, is “basic,” namely, the pull-back of a form on MM. The 
following two propositions summarize the major basic results about these forms. 
Proposition 7.1. For any connection V on P — M, p © Ty, the forms p(Q) and 
p(2°) are closed. Hence p(Q?) represents a deRham cohomology class 


(7.6) [p(°)]| € H?* (M,C). 


If q € T;, then pq € Tj 4% and (pq)(Q) = p(Q) A q(Q). Furthermore, if f : N > 
M is smooth and V the connection on f* P pulled back from V on P, which has 
curvature OQ = f*Q, then 


(7.7) pO) = f*p(2"). 
Proposition 7.2. The cohomology class (7.6) is independent of the connection on 
P, so it depends only on the bundle. 


The map Z, — H?*(M,C) is called the Chern—Weil homomorphism. We 
first prove that d p(Q) = 0 on P, the rest of Proposition 7.1 being fairly 
straightforward. If we differentiate with respect to t at t = 0 the identity 


(7.8) P(Ad(Exp tY)X,..., Ad(Exp tY)X) = p(X), 
we get 
(7.9) po 20 sere ed neere: 9) 


Into this we can substitute the curvature form 9 for X and the connection form 
for Y, to get 


(7.10) PO an heh S0: 


Now the Bianchi identity dQ = —[€,Q] obtained in (6.8) shows that (7.10) is 
equivalent to d p(Q) = 0 on P. Since 7* : AJM — AJP is injective and (7.5) 
holds, we also have d p(Q?) = 0 on M, and Proposition 7.1 is proved. 

The proof of Proposition 7.2 is conveniently established via the following 
result, which also has further uses. 


Lemma 7.3. Let &) and &; be any g-valued 1-forms on P (or any manifold). Set 
a= & —&, & =f + ta, andQ, = d& + (1/2)[&, &:]. Given p € Ly, we have 


1 
(7.11) (0) ~ p{M%) = ka] f P(a,Q:,..-,Q4) dt] . 
0 


Proof. Since (d/dt)Q: = da + [&, a], we have 


664 C. Connections and Curvature 


d 
(7.12) ay Pie) = & Pldar + [be, a], M5... , Me). 
It suffices to prove that the right side of (7.12) equals k dP(a,Q:,..., 24). This 
follows by the “Bianchi” identity dQ, = —[€,,,] and the same sort of arguments 
used in the proof of Proposition 7.1. Instead of (7.8), one starts with 


P(Ad(Exp tY)Z, Ad(Exp tY)X,..., Ad(Exp tY)X) = P(Z,X,...,X). 


To apply this to Proposition 7.2, let &) and €; be the connection forms asso- 
ciated to two connections on P + M, so Qo and (), are their curvature forms. 
Note that each €, defines a connection form on P, with curvature form Q;. Fur- 
thermore, a = €, — £0, acting on X#eE T,P, depends only on T.X* €T,M 
and gives rise to an Ad P-valued 1-form a? on M. Thus the right side of (7.11) is 
the pull-back via 7* of the (2k — 1)-form 


1 
(7.13) kd / Po 0? xi.) at 
0 


on M, which yields Proposition 7.2. 

We can also apply Lemma 7.3 to £; = €, a connection 1-form, and £) = 0. 
Then &, = t&; denote d& + (1/2)[;, &] by ®;. We have the (2k — 1)-form on P 
called the transgressed form: 


1 
(7.14) Tp(Q) =k 7 P(E, ®;,...,®,) dt, 
0 
with 
(7.15) ©, =td€+ sPI6.8. 


Then Lemma 7.3 gives 
(7.16) P(Q) = dTp(Q); 


that is, p(Q) is an exact form on P, not merely a closed form. On the other hand, as 
opposed to p(Q) itself, T’p(Q) is not necessarily a basic form, that is, the pull-back 
of a form on M. In fact, p(Q”) is not necessarily an exact form on M; typically 
it determines a nontrivial cohomology class on M. Transgressed forms play an 
important role in Chern—Weil theory. 

The Levi—Civita connection on an oriented Riemannian manifold of dimen- 
sion 2 can be equated with a connection on the associated principal $+-bundle. 
If we identify S' with the unit circle in C, its Lie algebra is naturally identified 
with 7R, and this identification provides an element of Z;, unique up to a constant 
multiple. This is of course a constant times the product of the Gauss curvature 


7. The Chern—Weil construction 665 


and the volume form, and the invariance of Proposition 7.2 recovers the indepen- 
dence (5.26) of the integrated curvature from the metric used on a Riemannian 
manifold of dimension 2. More generally, for any complex line bundle L over 
M, a manifold of any dimension, L can be associated to a principal $+-bundle, 
and the Chern—Weil construction produces the class [0] € H?(M,C). The class 
c1(L) = —(1/27i)[Q] € H?(M, C) is called the first Chern class of the line bun- 
dle L. In this case, the connection 1-form on P can be identified with an ordinary 
(complex-valued) 1-form, and it is precisely the transgressed form (7.14). 
Note that if dim M/ = 2, then (5.39) says that 


c1(L)[M] = Index X, 


for any nonvanishing section X of L over M \ {pi,..., px}. 

For general G, there may be no nontrivial elements of Z;. In fact, if p: g > R 
is a nonzero linear form, V = ker p is a linear subspace of g of codimension 1, 
which is Ad G-invariant if p € Z,. This means V is an ideal: [V, g] C V. Thus 
there are no nontrivial elements of Z; unless g has an ideal of codimension 1. In 
particular, if g is semisimple, Z; = 0. 

When G is compact, there are always nontrivial elements of Z2, namely, Ad- 
invariant quadratic forms on g. In fact, any bi-invariant metric tensor on G gives 
a positive-definite element of Z2. Applying the Chern—Weil construction in this 
case then gives cohomology classes in H*(M, C). 

One way of obtaining elements of Z;, is the following. Let 7 be a representation 
of G on a vector space V,,, and set 


(7.17) Prk(X) = TrA®dn(X), X eg, 


where dz(X ) denotes the representation of g on V,,. In connection with this, note 
that 


M 
(7.18) det (AI + dx(X)) = S> AM) Tr Midn(X), M = dim Vy. 
j=0 
If P + M isa principal U(n)-bundle or Gl(n, C)-bundle, and 7 the standard 


representation on C”, then consider 


1 oes “ n—k 
(7.19) det ( - sai?) = 2, col) , 


The classes [c,(Q)] € H?*(M,C) are the Chern classes of P. If E + M is the 
associated vector bundle, arising via the standard representation 77, we also call 
this the Ath Chern class of E: 


(7.20) cy(E) = [ex(Q)] € H?7*(M,C). 


The object 


666 C. Connections and Curvature 


(7.21) e(B) = So ex(B) € GB H7*(M,C) 


k=0 


is called the total Chern class of such a vector bundle. 
If P — M is a principal O(n)-bundle, and 7 the standard representation on 
R”, then consider 


1 “ n—k 
(7.22) det (a = 5-1) = On. 
k=0 
The polynomials d;,(Q) vanish for k odd, since Q = —Q, and one obtains 


Pontrjagin classes: 
(7.23) pr(Q) = do.(Q) € H**(M,R). 
If F — M is the associated vector bundle, arising from the standard representa- 


tion 7, then p;,(F’) is defined to be (7.23). 


Exercises 
1. If E and F are complex vector bundles over /, we can form E @ F — M. Show that 
c(E ® F) =c(E)Ac(F), 


where c(£) is the total Chern class given by (7.21), that is, 


1 even 
c(E) = det (1 Dai a) He (M,C), 


for a curvature 2-form arising from a connection on M. 
2. Define the Chern character of a complex vector bundle EH — M as the cohomology 
class Ch(E£) € H°"(M, C) of 


Ch(Q) = Tr e7 2/27*, 


writing Tr e 8/2nt @i>0 A?* P via the power-series expansion of the exponential 
function. Show that 7 


Ch(E ®@ F) = Ch(F£) + Ch(F), 
Ch(E @ F) = Ch(E) A Ch(F) 
in HY" (M,C). 
3. If F — M is areal vector bundle and F = F' ® C is its complexification, show that 
pi (F) = (—1)’c25(E). 


4. Using so(4) * so(3) @ so(3), construct two different characteristic classes in 
H‘(M, C), when M is a compact, oriented, four-dimensional manifold. 


8. The Chern—Gauss—Bonnet theorem 667 
8. The Chern—Gauss—Bonnet theorem 


Our goal in this section is to generalize the Gauss—Bonnet formula (5.1), produc- 
ing a characteristic class derived from the curvature tensor 2 of a Riemannian 
metric on a compact, oriented manifold M, say e(Q) € A"(M), such that 


(8.1) / e() = x(M), 


M 


the right side being the Euler characteristic of M. 
A clue to obtaining e(Q) comes from the higher-dimensional generalization of 
the index formula (5.47), namely, 


(8.2) Index(X) = x(M), 


valid for any vector field X on M with isolated critical points. The relation 
between these two when dim M = 2 is noted at the end of §5. It arises from 
the relation between Index(X) and the degree of the Gauss map. 

Indeed, let 17 be a compact, n-dimensional submanifold of Rete X a (tan- 
gent) vector field on M with a finite number of critical points, and T a small 
tubular neighborhood of M. By Corollary 20.5 of Chap. 1, we know that if 
N : OT ~ S"*+*-! denotes the Gauss map on OT, formed by the outward- 
pointing normals, then 


(8.3) Index(X) = Deg(V). 


As noted at the end of §5 of this chapter, if / is a surface in R?, with Gauss 
map Nyy, then Deg(Nyy) = (1/47) fy, K dV, where K is the Gauss curva- 
ture of M, with its induced metric. If 7 is a small tubular neighborhood of / in 
this case, then OT is diffeomorphic to two oppositely oriented copies of M, with 
approximately the same metric tensor. The outer component of OT has Gauss map 
approximately equal to Njz, and the inner component has Gauss map approx- 
imately equal to —Njy. From this we see that (8.2) and (8.3) imply (8.1) with 
e(Q) = (1/27)K dV in this case. 

We make a further comment on the relation between (8.2) and (8.3). Note that 
the right side of (8.3) is independent of the choice of X. Thus, any two vector 
fields on M with only isolated critical points, have the same index. Suppose MZ 
has a triangulation 7 into n-simplices. There is a construction of a vector field X,, 
illustrated in Fig. 5.3 for nm = 2, with the property that X, has a critical point at 
each vertex, of index +1, and a critical point in the middle of each j-simplex in 
T, of index (—1), so that 


(8.4) Index(X,) = $>(-1)’v;(M), 


668 C. Connections and Curvature 


where 1; (1) is the number of j-simplices in the triangulation 7 of /. We leave 
the construction of X, in higher dimensions as an exercise. 

A proof that any smooth, paracompact manifold M is triangulable is given in 
[Wh]. There it is shown that if you imbed M smoothly in R%, produce a fine 
triangulation of RY, and then perhaps jiggle the imbedding a bit, the intersection 
provides a triangulation of M/. 

Now, in view of the invariance of Index(X), it follows that the right side of 
(8.4) is independent of the triangulation of /. Also, if AZ has a more general cell 
decomposition, we can form the sum on the right side of (8.4), where v; stands for 
the number of 7-dimensional cells in M. Each cell can be divided into simplices 
in such a way that a triangulation is obtained, and the sum on the right side of 
(8.4) is unchanged under such a refinement. This alternating sum is one definition 
of the Euler characteristic, but as we have used another definition in §88 and 9 of 
Chap. 5, namely 


(8.5) x(M) = S°(-1)' dim #7(M), 
j=0 


we will temporarily denote the right side of (8.4) by y-(M). 

Now we tackle the question of representing (8.3) as an integrated curvature, 
to produce (8.1). We begin with the case when M is a compact hypersurface in 
IR” +1, In that case we have, by (4.66), 


2 
(8.6) Deg(N) = ae J (ace An) dV, for n even, 
M 


where A,, is the area of S” and Ay : T,M — T,M is the Weingarten map. 
The factor 2 arises because OT consists of two copies of 17. We can express det 
Ay directly in terms of the Riemann curvature tensor Rjxem of M, using Gauss’ 
Theorema Egregium. 

In fact, with respect to an oriented orthonormal basis {e,;} of T,.M, the matrix 
of Ay has entries Aj, = TI (e;, ex), and by (4.14), 
(8.7) Rjkem = (Rez, €m)ex, 3) = det es re . 

Ary Ao; 


In other words, the curvature tensor captures the action of A? Ay on AAT . 
If n = 2k is even, we can then express det Ay as a polynomial in the components 
Rjkem, using 


(det Ay)e1 A+++ Aen = (A"Ay)(e1 A+++ A en) 


8.8 
88) = (Ae; A Aezg) A+++ A (Aen-1 A Aen). 


Now, by (8.7), 


8. The Chern—Gauss—Bonnet theorem 669 
1 
(8.9) Ae; \ Aen = 5 S > Remi ee Am: 


Replacing (1,...,7) in (8.8) with all its permutations and summing, we obtain 
1 : 
(8.10) det Aw = sa] Yo (sgn j)(sgn k) Rj jakike *** Rin ajnkn abn 
er 


where j = (j1,---;Jn) Stands for a permutation of (1,...,n). The fact that the 
quantity (8.10), integrated over M, is equal to (A,,/2)y(M) when M is a hyper- 
surface in R"+1 was first established by E. Hopf, as a consequence of his result 
(8.2). The content of the generalized Gauss—Bonnet formula is that for any com- 
pact Riemannian manifold M of dimension n = 2k, integrating the right side of 
(8.10) over M gives (Ap, /2)x(M). 

One key point in establishing the general case is to perceive the right side of 
(8.10) as arising via the Chern—Weil construction from an invariant polynomial 
on the Lie algebra g = so(n), to produce a characteristic class. Now the curvature 
2-form can in this case be considered a section of A?7* @ AT", reflecting the 
natural linear isomorphism g ~ A?7™. Furthermore, A*T* @ A*7™ has a product, 
satisfying 


(8.11) (a1 ® 31) A (ag ® Bo) = (a1 A a2) @ (G1 A Be). 
If we set 

1 
(8.12) Q= Zoe Rivem(e; A er) ® (€e A em); 


then we form the k-fold product, k = n/2, obtaining 


(8.13) QA:--AQ = 2-" S “(sgn jf) (sgn k) Rj, jokr ko *** Rin —rinkn—-vkn (WOW), 
jk 


with w = e; A --: A ey. Thus, the right side of (8.10), multiplied by w @ w, is 
equal to gn/2 /n! times the right side of (8.13). (Observe the distinction between 
the product (8.11) and the product on End(£) @ A*T, used in (7.19) and (7.22), 
which assigns a different meaning to A--- AQ.) 

Now the Chern—Weil construction produces (8.13), with w © w replaced by w, 
if we use the Pfaffian 


(8.14) Pf: so(n) —> R, n= 2k, 


defined as follows. Let € : so(n) + A?R” be the isomorphism 


1 
(8.15) &(X) =5 SS Xjnej Nex, X =(Xjx) € so(n). 


670 C. Connections and Curvature 


Then, if n = 2k, take a product of k factors of €(X) to obtain a multiple of 
w =e, A+++ A en. Then Pf (X) is uniquely defined by 


(8.16) E(X) A+++ A€(X) =k! Pf (X)w. 
Note that if T : R” — R” is linear, then T*€(X) = €(T' XT), so 
(8.17) Pf (T'XT) = (det T)Pf (X). 


Now any X € so(n) can be written as X = T*YT, where T € SO(n), that is, T 
is an orthogonal matrix of determinant 1, and Y is a sum of 2 x 2 skew-symmetric 
blocks, of the form 


O ALD 


(8.18) YS a 0 


i AV ER. 


Thus €(Y) = Aze1 A eg +--+ + AKen—1 A en, SO 
(8.19) EY) = AyeoDg. 
Note that det Y = (A; --- Ax)”. Hence, by (8.17), we have 
(8.20) Pf(X)? = det X, 


when X is a real, skew-symmetric, n x n matrix, n = 2k. When (8.17) is special- 
ized to T € SO(n), it implies that Pf is an invariant polynomial, homogeneous of 
degree k (i.e., Pf € Zp, k = n/2). 

Now, with © in (8.12) regarded as a g-valued 2-form, we have the left side 
of (8.13) equal to (1/k!)Pf(Q). Thus we are on the way toward establishing the 
generalized Gauss—Bonnet theorem, in the following formulation. 


Theorem 8.1. /f M is a compact, oriented Riemannian manifold of dimension 
n = 2k, then 


(8.21) x(M) = (2n)-* [ PFO). 


M 


The factor (20)-* arises as follows. From (8.10) and (8.13), it follows that 
when M is a compact hypersurface in R”*!, the right side of (8.6) is equal to 
Cr fee Pf (Q), with 


9k+1 k! 


Now the area of the unit sphere is given by 


8. The Chern—Gauss—Bonnet theorem 671 


k+l /2 _ ark 


“TEED DO) 


as is shown in Appendix A of Chap. 3; substituting this into (8.22) gives C, = 
(2r)-*. 

We give a proof of this which extends the proof of (5.24), in which handles are 
added to a surface. To effect this parallel, we consider how the two sides of (8.21) 
change when M is altered by a certain type of surgery, which we will define in 
the next paragraph. 

First, we mention another ingredient in the proof of Theorem 8.1. Namely, the 
right side of (8.21) is independent of the choice of metric on M. Since different 
metrics produce different SO(2k) frame bundles, this assertion is not a simple 
consequence of Proposition 7.2. We will postpone the proof of this invariance 
until near the end of this section. 

We now describe the “surgeries” alluded to above. To perform surgery on Mo, 
a manifold of dimension n, excise a set Ho diffeomorphic to Sé-! x B™, with 
m+é—1=n, where B™ = {x € R™: |x| < 1}, obtaining a manifold with 
boundary X, 0X being diffeomorphic to $‘~! x $™~!. Then attach to X a copy 
of B“ x $™—!, sewing them together along their boundaries, both diffeomorphic 
to S! x $™-! to obtain M,. Symbolically, we write 


(8.23) Mj =X#H, M, = X#,. 
We say Mj is obtained from Mp by a surgery of type (@, m). 
We compare the way each side of (8.21) changes when M changes from Mo 


to M;. We also look at how y,(/), defined to be the right side of (8.4), changes. 
In fact, this definition easily yields 


(8.24) x(X#H1) = x(X#Ho) — x(Ho) + x(A1). 


For notational simplicity, we have dropped the “‘c” subscript. It is more convenient 
to produce an identity involving only manifolds without boundary, so note that 


x(Ho#Ho) = 2x(Ho) — x(0Ho), 


(8.25) x(Hi#H) = 2x(Hi) — x(OM1), 


and, since 0Hy = OHj, we have 

1 1 
(8.26) x(A1) — x( Ho) = 5X(Ai#H1) = 9 X(Ho# Ho), 
hence 


(827) x(Ma) = x(Mo) + 5x(F# Fh) ~ 5x(Hot#H). 


672 C. Connections and Curvature 


Note that Hp##Hy = S*-! x S™, Hi #H, = S* x S™—1. To compute the 
Euler characteristic of these two spaces, we can use multiplicativity of ~. Note 
that products of cells in Y; and Y2 give cells in Y; x Yo, and 


(8.28) v3(¥i x Yo) = S© vi(Vi)ve(¥a); 
itk=j 


then from (8.4) it follows that 


(8.29) x(¥i x Yo) = S01)? SO i(Vi)un (¥2) = xi) x(¥2). 
j>0 itk=j 


Using the fairly elementary result that 


a coca 
(8.30) x(S7) = 2 ad . even, 

0 if 7 is odd, 
we have x(Ho#Ho) — x(Hi#H1) equal to 4 if @ is odd and m even, —4 if @ is 
even and m odd, and 0 if 2 and m have the same parity (which does not arise if 
dim M is even). 

The change in x,.(/) just derived in fact coincides with the change in y(M), 
defined by (8.5). This follows from results on deRham cohomology obtained in 
Chap. 5. In fact (B.8) of Chap. 5 implies (8.24), from which (8.25)-(8.27) follow; 
(8.52) of Chap.5 implies (8.29) when Y; are smooth, compact manifolds; and 
(8.56) and (8.57) of Chap. 5 imply (8.30). 

Thus, for e(M) = J,, e({2) to change the same way as (J) under a surgery, 
we need the following properties in addition to “functoriality.” We need 


e(S? x S*)=0 if j ork is odd, 


(8.31) of 
4 ifj and k are even. 


If e(Q) is locally defined, we have, upon giving X, Ho, and H, coherent orienta- 
tions, 


(8.32) fem= fem fem+ feo, 
M, Mo Ho Ay 


parallel to (8.24). Place metrics on M; that are product metrics on (—e,€) x 
S*1 x §™~1 on a small neighborhood of 0X. If we place a metric on Hj #H; 
which is symmetric with respect to the natural involution, we will have 


(8.33) fam=; / e(Q), 


8. The Chern—Gauss—Bonnet theorem 673 


provided e(Q) has the following property. Given an oriented Riemannian man- 
ifold Y, let Y# be the same manifold with orientation reversed, and let the 
associated curvature forms be denoted by Qy and Qy-#. We require 


(8.34) ely ee) 


Now e(Q) = Pf (22/27) certainly satisfies (8.34), in view of the dependence 
on orientation built into (8.16). To see that (8.31) holds in this case, we need 
only note that S* x S* can be smoothly imbedded as a hypersurface in R¢*+**?, 
This can be done via imbedding S“ x I x B® into Ré+*+! and taking its boundary 
(and smoothing it out). In that case, since Pf(QQ/27:) is a characteristic class whose 
integral is independent of the choice of metric, we can use the metric induced from 
the imbedding. We now have (8.31)—(8.33). Furthermore, for such a hypersurface 
M = H;#H4H,, we know that the right side of (8.21) is equal to x.(H;#H;), by 
the argument preceding the statement of Theorem 8.1, and since (8.29) and (8.30) 
are both valid for both x and x., we also have this quantity equal to x(H;#4H;). 

It follows that (8.21) holds for any / obtainable from S” by a finite number of 
surgeries. With one extra wrinkle we can establish (8.21) for all compact, oriented 
M. The idea for using this technique is one the author learned from J. Cheeger, 
who uses a somewhat more sophisticated variant in work on analytic torsion [Ch]. 

Assume M is connected. Give MM x R the product Riemannian metric, fix a 
point p € M, and, with g = (p,0) € M x R, consider on M x R the function 
fo(x,t) = dist((x,t),¢q)?. For R sufficiently large, fj '(R) is diffeomorphic to 
two copies of M, under the map (x,t) ++ x. For r > 0 sufficiently small, f9 '(7) 
is diffeomorphic to the sphere S”. 

Our argument will use some basic results of Morse theory. A Morse function 
f : Z — Ris a smooth function on a manifold Z all of whose critical points 
are nondegenerate, that is, if Vf(z) = 0, then D? f(z) is an invertible v x v 
matrix, y = dim Z. One also assumes that f takes different values at distinct 
critical points and that f~!(4) is compact for every compact K C R. Now the 
function fp above may not be a Morse function on Z = M x R, but there will 
exist a smooth perturbation f of fo which is a Morse function. A proof is given 
in Proposition 4.3 of Appendix B. The new f will share with fp the property that 
f—1(r) is diffeomorphic to S$” and f~!(R) is diffeomorphic to two copies of M. 
Note that an orientation on M induces an orientation on M x R, and hence an 
orientation on any level set f~'(c) that contains no critical points. In particular, 
f—1(R) is a union of two copies of M with opposite orientations. The following 
is a basic tool in Morse theory. 


Proposition 8.2. If ci < c2 are regular values of a Morse function f : Z + R 
and there is exactly one critical point zo, with c, < f(z) < Ca, then Mz = 
f-1(c2) is obtained from M, = f~'(c1) by a surgery. In fact, if D? f(z) has 
signature (£,m), Mo is obtained from M, by a surgery of type (m, £). 


This is a consequence of the following result, known as the Morse lemma. 


674 C. Connections and Curvature 


Theorem 8.3. Let f have a nondegenerate critical point at p € Z. Then there is 
a coordinate system (#1, ...,@n,) centered at p in which 


(8.35) f(x) = f(p) + ei tees +09 — 2h ++ — Bom 


near the origin, where €+m =v =dim Z. 


Proof. Suppose that in some coordinate system D? f (p) is given by a nondegen- 
erate, symmetric, v x v matrix A. It will suffice to produce a coordinate system 
in which 


(8:36) F(a) = fle) + 5Ae-2, 


near the origin, since going from here to (8.35) is a simple exercise in linear 
algebra. We will arrange (8.36) by an argument, due to Palais, similar to the proof 
of Darboux’ theorem in Chap. 1, §14. 

Begin with any coordinate system centered at p. Let 


1 
(8.37) wi =df, wo =dg, where g(x) = gfe a, 


with A = D?f(0) in this coordinate system. Set w, = tw, + (1 — t)wo, which 
vanishes at p for each t € [0, 1]. The nondegeneracy hypothesis on A implies that 
the components of each w; have linearly independent gradients at p; hence there 
exists a smooth, time-dependent vector field X; (not unique), such that 


Let F; be the flow generated by X;, with Fy = Id. Note that F; fixes p. It is then 
an easy computation using (8.38), plus the identity £x»w = d(w|X) + (dw) |X, 
that 


d 2k 
(8.39) 7 (Fame) = 0. 


Hence Fw 1 = wo, so f o F; = g and the proof of Proposition 8.3 is complete. 


From Theorem 8.2, it follows that, given any compact, oriented, connected MV, 
of dimension n, a finite number of surgeries on S” yields two copies of M, with 
opposite orientations, say M and M*. Hence (8.21) holds with M replaced by 
the disjoint union MU M # But, in view of (8.34), both sides of the resulting 
identity are equal to twice the corresponding sides of (8.21); for y, this follows 
easily from (8.4), and for y it follows immediately from (8.5). We hence have the 
Chern—Gauss—Bonnet formula and also the identity x(M) = x-(M), modulo the 
task of showing the invariance of the right side of (8.21) under changes of metric 
on M. 


8. The Chern—Gauss—Bonnet theorem 675 


We turn to the task of demonstrating such invariance. Say go and g; are 
two Riemannian metric tensors on M, with associated SO(n)-bundles Py — 
M, P, — M, having curvature forms Qo) and 2Q;. We want to show that 
Pf (8) — Pf (2) is exact on M. To do this, consider the family of metrics 
gt = tgi+(1—t) go on M, with associated SO(n)-bundles P, + M, fort € [0,1]. 
These bundles fit together to produce a principal SO(n)-bundle P + M x [0,1]. 
We know there exists a connection on this principal bundle. Let T = 0/0¢ on 
M x (0, 1], and let T denote its horizontal lift (with respect to a connection chosen 
on P). The flow generated by T commutes with the SO(n)-action on P. Flowing 
along one unit of time then yields a diffeomorphism ® : Po — P,, commut- 
ing with the SO(n)-action, hence giving an isomorphism of SO(n)-bundles. Now 
applying Proposition 7.2 to the original connection on Po and to that pulled back 
from P, gives the desired invariance. 

Before Chern’s work, H. Hopf had established Theorem 8.1 when M is a 
compact hypersurface in IR2*+1. Then C. Allendoerfer [Al] and W. Fenchel [Fen] 
proved (8.21) for the case when M is isometrically imbedded in R"**, by relat- 
ing the integral on the right to the integral over OT of the Gauss curvature of 
the boundary of a small tubular neighborhood 7 of M, and using the known 
result that x(OT) = 2x(M). At that time it was not known that every compact 
Riemannian manifold could be isometrically imbedded in Euclidean space. By 
other means, Allendoerfer and A.Weil [AW] proved Theorem 8.1, at least for real 
analytic metrics, via a triangulation and local isometric imbedding. Chern then 
produced an intrinsic proof of Theorem 8.1 and initiated a new understanding of 
characteristic classes. 

In Chern’s original paper [Cher], it is established that [,, Pf(Q/27) is equal 
to the index of a vector field _X on M, by a sophisticated variant of the argument 
establishing Proposition 5.4, involving a differential form on the unit-sphere bun- 
dle of M related to, but more complicated than, the transgressed form (7.14). An 
exposition of this argument can also be found in [Poo] and in [Wil]. When dim 
M = 2, one can identify the unit-sphere bundle and the frame bundle, and in that 
case the form coincides with the transgressed form and the argument becomes 
equivalent to that used to prove Proposition 5.4. An exposition of the proof of 
Theorem 8.1 using tubes can be found in [Gr]. 

The Chern—Gauss—Bonnet theorem can also be considered as a special case of 
an index theorem for an elliptic differential operator. A proof in that spirit is given 
in Chap. 10, §7. More material on the Pfaffian is also developed there. 

We mention a further generalization of the Gauss—Bonnet formula. If LE — 
X is an SO(2k)-bundle over a compact manifold X (say of dimension n), with 
metric connection V and associated curvature Q, then Pf (02/27) is defined as a 
(2k)-form on X. This gives a class Pf (EL) € H?*(X), independent of the choice 
of connection on F, as long as it is a metric connection. There is an extension of 
Theorem 8.1, describing the cohomology class of Pf (EZ) in H?*(X). Treatments 
of this can be found in [KN] and in [Spi]. 


676 C. Connections and Curvature 


Exercises 


1. Verify that when Q is the curvature 2-form arising from the standard metric on S?*, 


then 6 


g2k 


2. Generalize Theorem 8.1 to the nonorientable case. (Hint: If M is not orientable, look at 
its orientable double cover M. Use (8.4) to show that .(17) = 2x(M).) Using (8.16) 
as a local identity, define a measure Pf(Q) in the nonorientable case. 

3. Let M be a compact, complex manifold of complex dimension n (i.e., real dimension 
2n). Denote by T its tangent bundle, regarded as a complex vector bundle, with fibers 
T, of complex dimension n. Show that 


[enlT) =x(0), 


where cn (T ) is the top Chern class, defined by (7.19) and (7.20). 
4. If M; are compact Riemannian manifolds with curvature forms 0; and M1 x M2 has 
the product metric, with curvature form Q, show directly that 


mi PE (21) A wSPE (M2) = PE (Q), 
where 77; projects M, x M2 onto M;. If dim M; is odd, set Pf (Q;) = 0. Use this to 
reprove (8.31) when e(Q) = Pf (Q). 


5. Show directly that the right sides of (8.2) and (8.3) both vanish when M is a hypersur- 
face of odd dimension in R™*?. 


9. Kahler manifolds and their curvature 

Let M be a Riemannian manifold with almost complex structure J satisfying 
(X,Y) = (JX, JY), where ( , ) is the Riemannian inner product. For X,Y € 
TM, we set 

(9.1) (X,Y) = (X,Y) + i(X, JY), 

and see that 


(9.2) (X,X)=(X,X), (JX,Y) =i(X,Y), (X,Y) =(Y,X), 


so we have a Hermitian metric. We call such M a Hermitian manifold. We also 
define the 2-form w on M by 


(9.3) w(X,Y) = (X, JY). 


9. Kahler manifolds and their curvature 677 


As seen in Chapter 12, §10, the condition that the almost complex structure be 
integrable is the vanishing of the tensor field NV’, defined by 


(9.4) N(X,Y)= I(LX, ¥|=(Ux, J¥]) = (7x, arabe JY). 
Furthermore, if V is the Levi-Civita connection on M, 
(9.5) 2((VxJ)Y, Z) = (dw)(X, JY, JZ) — (dw)(X, Y, Z) + NY, Z),X). 


Definition. The Hermitian almost complex manifold M is a Kahler manifold pro- 
vided VJ = 0. 


Proposition 9.1. The manifold M is Kahler if and only if its almost complex 
structure is integrable and dw = 0. 


Proof. To see <, note that the hypotheses VV = 0, dw = 0 imply the right side 
of (9.5) vanishes, hence the left side vanishes. For the converse, note that 


(9.6) VJ=0 Vw =0 dw = 0, 


and then (9.5) gives 
(9.7) VI=0=N=0. 


Corollary 9.2. Given that M is a complex manifold with a Hermitian metric, it 
is Kahler if and only if dw = 0. 


Proof. On the one hand, we have (9.6), so VJ = 0 = dw = 0. On the other, we 
have (9.5) with V = 0, sodw =0 > VJ = 0. 


To celebrate this result, we call w the Kahler form. 
Note that VJ = 0 if and only if 


(9.8) Vy (JX) = JVyX, 


for all vector fields X and Y on M. Another equivalent condition is that parallel 
transport along each smooth curve in M is C-linear. 

As seen in the exercises as the end of §12.10, each Hermitian symmetric space 
is a Kahler manifold. Examples include the complex projective spaces P”(C), and 
more generally the complex Grassmannians G;,,n,(C), the space of k-dimensional 
C-linear subspaces of C”. We also have the following. 


Proposition 9.3. If M is a Kahler manifold and X is a complex submanifold, 
then X, with the induced Riemannian metric, is also a Kahler manifold. 


678 C. Connections and Curvature 

Returning to the conclusion of Corollary 9.2, we note that since J is invertible, 
w is nondegenerate. Hence, if 17 has complex dimension n (hence real dimension 
2n), then, forl <j <n, 
(9.9) wi EA(M), wi A0, dui =0. 
We claim that, forl < 7 < n, w/ is not cohomologous to 0 if MW is compact. 
Clearly w” is not; it is nowhere vanishing, so (if 7 is connected) [ ue” #0.On 


the other hand, if 7 < n, 


(9.10) wi = dB => w® = w" 4 AdB=+d(w" A B), 


giving a contradiction. We have: 
Proposition 9.4. If M is a compact Kahler manifold, of real dimension 2n, 
(9.11) H74(M)#0, forl<j<n. 


This enables us to identify some compact, complex manifolds that do not have 
Kahler structures, such as (for n > 2) 


(9.12) M=C"\0/+, z~2%z, VJ EN, 
which is diffeomorphic to $?"~1 x $1, forcing (9.11) to fail. 
We now look at some special properties of the Riemann tensor and the Ricci 


tensor on VM, when J is a Kahler manifold. 


Proposition 9.5. If M is a Kahler manifold, with Riemann tensor R, then 


(9.13) R(X,Y)JZ = JR(X,Y)Z, 
(9.14) R(X, JY) = —R(JX,Y), 
and 

(9.15) R(JX, JY) = R(X,Y). 


Proof. To start, 


R(X,Y)JZ = VxVyJZ -—VyVxJZ—-VixyJZ 


(9.16) 
= IVxVyZ — IVyVxZ-IVixyZ, 


the second identity by (9.8). This gives (9.13). Next, 


9. Kahler manifolds and their curvature 679 


(R(X, JY)Z,W) = (R(Z,W)X, JY) 
= —(JR(Z,W)X,Y) 
= —(R(Z,W)JX,Y) 
= —(R(JX,Y)Z,W), 


(9.17) 


the third identity by (9.13). The first and fourth identities follow from the basic 
symmetry 


(9.18) (R(U,V)Z,W) = (R(Z,W)U,V). 
This gives (9.14), from which (9.15) readily follows. 


We next consider the Ricci tensor, which, we recall, is given by 


Ric(X,¥) = }" (R(X, Bj) Ej, Y) 
(9.19) 


at « € M, where {£;} is an orthonormal basis of TM. We have the following. 
invariance of Ric. 


Proposition 9.6. If M is a Kahler manifold, with Ricci tensor Ric, then 
(9.20) Ric( JX, JY) = Ric(X,Y). 


Proof. Using (9.19), with X,Y replaced by JX, JY and the orthonormal set 
{E;} replaced by {JE;}, we have 


Ric(JX, JY) = 5 “(R(JX, JE;)JE;, JY) 
(9.21) 
=) (R(X, B;) JE;, JY), 


the second identity by (9.15). Then (9.13) gives R(X, E;) JE; = JR(X, E;)E;, 
and we get (9.20), via (JU, JV) = (U,V). 


Corollary 9.7. When M is a Kahler manifold, the tensor p defined by 


(9.22) p(X, Y) = Ric(X, JY) 
satisfies 
(9.23) o(X,¥) = —plY, X). 


Proof. We have from (9.20) that 


680 C. Connections and Curvature 


Ric(X, JY) = — Ric(JJX, JY) 


9.24 
— = —Ric(JX,Y) = —Ric(Y, JX), 


the last identity by the symmetry of Ric, and this gives (9.23). 
Hence p defines a 2-form on M, if M is Kahler: 


(9.25) pe A?M. 
This is called the Ricci form of M. It has the following key property. 


Proposition 9.8. If M is a compact Kahler manifold, then its Ricci form p is 
closed, hence defines a cohomology class [p| € H?(M). Furthermore, 


(9.26) [p] = 271 (M). 
For a proof, we refer to [Bes], Chapter 2, Proposition 2.75. We mention that 
the Bianchi identity for Ric plays a key role in the proof that dp = 0. 
If M is a Kahler manifold that is also an Einstein manifold, 1.e., 
(9.27) Ric = Ag, 
for some real A, where g denotes the metric tensor, or equivalently, 
(9.28) p=, 
we say / is a Kahler-Einstein manifold. Recall from Proposition 3.3 that if (9.27) 
holds, then A must be constant (this via the Ricci identity, which follows from the 
Bianchi identity). In such a case, (9.28) directly leads to dp = 0. 
Note that if we define the (1,1) tensor field Ric by 
(9.29) (Ric(X), Y) = Ric( X,Y), 
then the Einstein condition becomes (for general Riemannian manifolds) 
(9.30) Ric = AL. 
Here is an alternative formulation of Proposition 9.6. 
Proposition 9.9. If M is a Kahler manifold, 
(9.31) Ric( JX) = JRic(X). 
Proof. By (9.20), 


(9.32) (Ric(X), VY) = (Ric( JX), JY) = —(IRie(JX),Y), 


References 681 


hence Ric(X) = —JRic( JX), which implies (9.31). 


We have the following simple path to showing that P”(C) is a Kahler-Einstein 
manifold. 


Proposition 9.10. Let M be a Kahler manifold. Given p € M, denote by G', the 
group of diffeomorphisms of M, fixing p, and preserving the Hermitian metric 
(9.1) (and hence the complex structure J). Thus we have an action 


(9.33) Ty: Gp —> L(TpM), (vp) = De(p), 


of C-linear transformations on T,M (with complex structure determined by J). If 
this action is irreducible, for each p, then M is Einstein. 


Proof. The action (9.33) commutes with Ric, € £(T,M), for each p, so (9.30) 
holds, with A, = ap + bpJ, dp, bp € R. Since Ric is self-adjoint with respect to 
the Riemannian inner product (, ), we have b, = 0, so (9.30) holds with A real. 
Arguments described above imply . is constant. 


Corollary 9.11. The spaces P”(C) are Kahler-Einstein manifolds. 


Proof. In these cases, the action (9.33) is equivalent to the standard action of 
U(n) on C”. 


Further calculation shows that \ > 0 in (9.27) when M = P"(C). One says 
P”(C) is a positive Kahler-Einstein manifold. Other such manifolds, arising as 
coadjoint orbits in g*, the dual to the Lie algebra of a compact Lie group, are 
discussed in Chapter 8 of [Bes]. A compact Kahler-Einstein manifold satisfying 
(9.27) with A < 0 is called a negative Kahler-Einstein manifold. One satisfyig 
(9.27) with A = 0 is called a Calabi-Yau manifold. See Chapter 11 of [Bes] for a 
description of existence results for these classes. Constructions of Kahler-Einstein 
metrics in these cases give rise to complex Monge-Ampere equations. See Chapter 
14, §15 for further discussion and references. 


References 


[AbM] R. Abraham and J. Marsden, Foundations of Mechanics, Benjamin/Cummings, 
Reading, MA, 1978 
[Al] C. Allendoerfer, The Euler number of a Riemannian manifold, Am. J. Math. 
62(1940), 243-248 
[AW] C. Allendoerfer and A. Weil, The Gauss—Bonnet theorem for Riemannian polyhe- 
dra, Trans. AMS 53(1943), 101-129 
[Ar] V. Armold, Mathematical Methods of Classical Mechanics, Springer, New York, 
1978 
[BGM] M. Berger, P. Gauduchon, and E. Mazet, Le Spectre d’une Variété Riemannienne, 
LNM no.194, Springer, New York, 1971 


682 C. Connections and Curvature 


[Bes 
[Car 
[Ch 
[CE 


Seek 


[Cher] 
[Chy] 
[EGH] 
[Fen] 
[FU] 
[Gil] 


[Gr] 
[Gu] 


[Gun] 


[Hi] 
[Jos] 


[Kaz] 
[KN] 
[LM] 


[Mil] 
[Mil2] 


[MiS] 
[Pal] 


[Poo] 
[Spi] 


[Stb] 
[Str] 
[Wh] 


[Wil] 


A. Besse, Einstein Manifolds, Springer, New York, 1987 

M. doCarmo, Riemannian Geometry, Birkhauser, Boston, 1992 

J. Cheeger, Analytic torsion and the heat equation, Ann. Math. 109(1979), 259-322 
J. Cheeger and D. Ebin, Comparison Theorems in Riemannian Geometry, North 
Holland, Amsterdam, 1975 

S. S. Chern, A simple intrinsic proof of the Gauss—Bonnet formula for closed Rie- 
mannian manifolds., Ann. Math. 45(1944), 747-752 

C. Chevalley, Theory of Lie Groups, Princeton University Press, Princeton, NJ, 
1946 

T. Eguchi, P. Gilkey, and A. Hanson, Gravitation, Gauge Theories, and Differential 
Geometry, Physics Reports, Vol. 66, no. 6 (1980) 

W. Fenchel, On total curvature of Riemannian manifolds, J. London Math. Soc. 
15(1940), 15-22 

D. Freed and K. Uhlenbeck, /nstantons and Four-Manifolds, Springer, New York, 
1984 

P. Gilkey, The Index Theorem and the Heat Equation, Publish or Perish, Boston, 
1974 

A. Gray, Tubes, Addison-Wesley, New York, 1990 

R. Gunning, Lectures on Riemann Surfaces, Princeton University Press, Princeton, 
NJ, 1967 

M. Giinther, On the perturbation problem associated to isometric embeddings of 
Riemannian manifolds, Ann. Global Anal. Geom. 7(1989), 69-77 

N. Hicks, Notes on Differential Geometry, Van Nostrand, New York, 1965 

J. Jost, Nonlinear Methods in Riemannian and Kahlerian Geometry, Birkhauser, 
Boston, 1988 

J. Kazdan, Prescribing the curvature of a Riemannian manifold, CBMS Reg. Conf. 
Ser. in Math., no. 57, AMS, Providence, RI, 1985 

S. Kobayashi and N. Nomizu, Foundations of Differential Geometry, Interscience, 
New York, Vol. 1, 1963; Vol. 2, 1969 

H. B. Lawson and M. L. Michelson, Spin Geometry, Princeton University Press, 
Princeton, NJ, 1989 

J. Milnor, Morse Theory, Princeton University Press, Princeton, NJ, 1963 

J. Milnor, Topology from the Differentiable Viewpoint, University Press of Virginia, 
Charlottesville, 1965 

J. Milnor and J. Stasheff, Characteristic Classes, Princeton University Press, 
Princeton, NJ, 1974 

R. Palais, ed., Seminar on the Atiyah—Singer Index Theorem, Princeton University 
Press, Princeton, NJ, 1963 

W. Poor, Differential Geometric Structures, McGraw-Hill, New York, 1981 

M. Spivak, A Comprehensive Introduction to Differential Geometry, Vols. 1-5, 
Publish or Perish Press, Berkeley, CA, 1979 

S. Sternberg, Lectures on Differential Geometry, Prentice Hall, Englewood Cliffs, 
NJ, 1964 

D. Struik, Lectures on Classical Differential Geometry, Addison-Wesley, Reading, 
MA, 1964 

H. Whitney, Geometric Integration Theory, Princeton University Press, Princeton, 
NJ, 1957 

T. Wilmore, Total Curvature in Riemannian Geometry, Wiley, New York, 1982 


Index 


A-hat genus, 357 

Abelian differential, 372 

adjoint, 12 

almost complex structure, 364, 582 
integrable, 583, 593 

angular momentum, 155 

anticommutation relation, 326 

asymptotic expansion, 13, 66 

Atiyah—Singer index formula, 349 

Atiyah—Singer index theorem, 558 


Bergman kernel function, 555 
Bergman metric, 559 
Bergman projection, 554 
Berry’s phase, 193 
Bessel equation, 280 
Bessel function, 169, 206 
Bianchi’s identity, 611, 613, 615, 619, 660, 
663 

Blumenthal’s 01 law, 436 
boundary problem 

regular, 52 
bounded geometry, 505 
Brownian motion, 406, 464 
Brownian scaling, 412 


Calder6én projector, 558 
Calderon-Vaillancourt theorem, 22 
Cameron-Martin-Girsanov formula, 489 
capacitary potential, 442 
capacity, 311, 440, 442 

inner, 452 

outer, 452 
Casimir operator, 157 
Chapman—Kolmogorov equation, 485 
characteristic, 31 
characteristic class, 357, 662, 669, 675 


Chern character, 357, 666 
Chern class, 358, 366, 371, 665, 676 
Chern—Gauss—Bonnet formula, 358 
Chern—Gauss—Bonnet theorem, 674 
Chern—Weil construction, 662 
Clifford algebra, 325, 331 
Clifford connection, 329 
Clifford module, 326 
Clifford multiplication, 329 
Codazzi’s equation, 631 
commutator, 15 
complex manifold, 583 
conditional expectation, 426 
cone, 168 
conic set, 31 
connection, 608, 656 
1-form, 609, 655 
coefficients, 609 
metric, 609 
contact manifold, 581 
Coulomb force, 149 
covariant derivative, 608, 656 
curvature, 344, 609, 617, 657 
2-form, 609, 658 
principal, 640 
scalar, 346 


D bar-complex, 531 
D bar-Neumann problem, 533 
Darboux theorem, 581 
deformation tensor, 347 
deRham complex, 526 
derivation, 611 
deuteron, 162 
diffraction, 181 
diffusion, 480 
on Riemannian manifolds, 491 
with drift, 501, 513 


© The Editor(s) (if applicable) and The Author(s), under exclusive license 


to Springer Nature Switzerland AG 2023 


M. E. Taylor, Partial Differential Equations I, Applied Mathematical Sciences 116, 


https://doi.org/10.1007/978-3-03 1-33700-0 


683 


684 Index 


Dirac operator, 341, 364 

twisted, 341, 349 
Dirac-type operator, 325 
Dirichlet boundary condition, 114, 121 
Dirichlet problem, 44, 417, 538 
distribution 

tempered, 106 
divergence theorem, 330 
divisor, 373 
domain of holomorphy, 552 
dominated convergence theorem, 440 
Duhamel’s principle, 518 
duplication formula, 175 


Egorov’s theorem, 28, 197 
eigenvalue, 119, 447 
Einstein tensor, 620 
elliptic, 17 

strongly, 18 
elliptic complex, 526 
elliptic regularization, 546 
energy 

conservation, 239 

local decay, 239 
ergodic theory, 203 
Euler characteristic, 358, 371, 642, 667, 

668 

Euler’s formula, 646 
exterior derivative, 326, 526 


Feynman—Kac formula, 413, 455 

Feynman-Kac formula, 499 

filtration, 335 

finite propagation speed, 114, 128, 244, 
257, 509 

first exit time, 421, 436 

first hitting time, 441 

flux, 225 

Fokker-Planck equation, 486 

Fourier inversion formula, 2, 105 

Fourier transform, 105, 228, 244 

Fredholm, 265, 349 

Friedrichs method, 110, 113 

Frobenius’s theorem, 610 

fundamental solution, 127, 180 

fundamental theorem, 638 


gamma function, 175 

Gauss curvature, 348, 620, 632, 640 

Gauss map, 626, 641, 667 

Gauss—Bonnet formula, 362, 371, 385 
generalized, 667 

Gauss—Bonnet theorem, 642 

Gauss—Codazzi equation, 631 

Gaussian integral, 65, 81 

Gegenbauer polynomial, 131 

genus, 371 

Green function, 221 

Green’s formula, 41, 423 

Gronwall inequality, 478 

Gronwall’s inequality, 26 

Garding’s inequality, 24, 577 


half-estimate, 538 
Hankel function, 219, 280 
Hankel transform, 170, 207 


harmonic oscillator, 86, 139, 357, 387, 416 


harmonic polynomial, 130 
heat equation, 119, 127, 350, 406 
heat kernel estimate, 511 
heat semigroup, 491 
Heisenberg group, 5, 581 
Hermite polynomial, 141 
Hermite semigroup, 142 
Hessian, 614 
Hodge decomposition, 549 
Hodge Laplacian, 327 
Hodge star operator, 328 
homotopic symbols, 388 
Hurwitz’ formula, 379 
Huygens principle, 128, 245, 253, 258 
hydrogen atom, 149 
hyperbolic 
strictly, 27 
symmetric, 25 
symmetrizable, 26 
hyperbolic space, 136, 512 
hypergeometric equation 
confluent, 153 
hypergeometric function, 175, 189 
confluent, 154, 160 
hypoelliptic, 491, 572 


ill-posed, 294 


Index 685 


index, 265, 349, 649, 667 mean value property, 438 

of a vector field, 642 Mehler’s formula, 144, 357, 416, 491 
index formula, 349 Mehler-Dirichlet formula, 132 

local, 351 Mellin transform, 159, 188 
inverse problem, 289, 294 microlocal, 32 

uniqueness, 290 monotone convergence theorem, 418 
irreducible representation, 133, 134 Morrey’s inequality, 534, 584 
Ito’s formula, 463, 465, 468, 483 Morse lemma, 673 
Kahler form, 677 negligible, 455 
Kahler manifold, 677 Neumann boundary condition, 114, 123 
Kahler-Einstein manifold, 680 Neumann operator, 42, 60, 236, 268, 281, 
Killing field, 348, 626 292, 533, 571, 598 

conformal, 348 Neumann problem, 46 
Kohn Laplacian, 575 Newlander—Nirenberg theorem, 590 
Kolmogorov diffusion equation Nijenhuis tensor, 593 


backward, 487 
Kolmogorov’s diffusion equation, 484 


Kolmogorov’s inequality, 425 optical theorem, 234, 276 

Kahler manifold, 593 oscillatory integral, 6 

Langevin equation, 464, 480 Palatini identity, 627 

Laplace operator, 119, 227, 573 parallel transport, 191, 608 

Laplace transform, 207 parametrix, 17, 59, 67, 76, 119 

layer potential, 38, 259, 559 Pfaffian, 360, 669 

Legendre function, 179 Picard iteration, 473 

Legendre polynomial, 131, 133, 288 Planck’s constant, 167 

Lenz vector, 155 plurisubharmonic, 537 

Levi form, 536, 563, 573, 584, 594 Poisson bracket, 16, 75 

Levi—Civita connection, 329, 616 Poisson integral, 44, 126, 423, 539 

Lidskii’s theorem, 315 Pontrjagin class, 358, 666 

Lie algebra, 142 potential well, 162 

limiting absorption principle, 219 principal bundle, 654 

line bundle principal symbol, 3, 35, 343 
holomorphic, 368 pseudodifferential operator, 3, 265, 572 

Lippman—Schwinger equation, 226 pseudolocal, 6 


Lipschitz domain, 48 


quadratic differential, 379 


Markov property, 427, 428 quantization, 92 
strong, 431, 438, 450 positive, 92 
martingale, 429, 469 quantum adiabatic limit, 191 
maximal inequality, 430 quantum ergodic theorem, 195, 201 
maximum principle, 45, 122, 224 quantum mechanics, 149 
strong, 118 


mean ergodic theorem, 198, 203 
L? , 204, 205 radiation condition, 217 


686 Index 


Radon inversion formula, 253 
Radon transform, 252, 258 
reciprocity relation, 236 
regular boundary point, 440 
Wiener criterion, 450 
regularity, 17 
microlocal, 32 
relativistic diffusion, 512 
Rellich formula, 51 
representation, 157 
Ricci form, 680 
Ricci tensor, 80, 345, 362, 619 
Riemann surface, 367 
Riemann tensor, 80, 617 
Riemann’s inequality, 374 
Riemann—Lebesgue lemma, 241 
Riemann-Roch formula, 371, 385 
Rodrigues formula, 133 


scalar curvature, 80, 619 
scattering, 217 
scattering amplitude, 223, 233, 265, 285, 
289, 455 

scattering character, 257, 273 
scattering cross section, 237 
scattering operator, 232, 248, 265, 285 
scattering phase, 278 
scattering pole, 263, 273 
Schrodinger operator, 116 
Schwartz kernel, 5 
Schwartz space, 104 
second fundamental form, 596, 628 
sectional curvature, 633 
self-adjoint, 103, 227 

essentially, 114 
semigroup, 481 

Lax—Phillips, 256 
signature, 328, 362 
singular integral operator, 559 
Sobolev space, 21 
spectral measure, 109 
spectral representation, 103, 108, 112, 244 
spectral theorem, 103, 227 
spherical function, 132 
spin manifold, 339 
spin structure, 339 
spin-c structure, 363 
spinor, 336, 339 


stationary phase method, 275 
stochastic continuity, 496 
stochastic differential equation, 464, 473 
stochastic integral, 455, 457, 466 
equation, 473, 480 
Stone’s theorem, 109 
Stone-Weierstrass theorem, 121, 230, 407, 
419, 435 
stopping time, 431, 438 
strongly pseudoconvex, 536 
subelliptic estimate, 542, 545 
subordination identity, 417 
supermartingale, 469 
supertrace, 352 
surgery, 671 
symbol, 3, 599 
symmetric space 
Hermitian, 593 


Tauberian theorem, 120, 133, 196, 279 

Theorema Egregium, 626, 631 

Toeplitz operator, 556 

trace, 66 

trace class, 22 

trace formula, 273 

translation representation, 251 

transport equation, 29, 56, 61 

triangulation, 646, 667 

Trotter product formula, 413, 455, 500, 
514 


uncertainty principle, 159 
uniformization theorem, 378 
uniformly rectifiable domain, 559 
unitary group, 115 

unitary operator, 104, 228, 239 
unitary trick, 108 

UR domain, 49 


vector bundle, 608, 654 


wave equation, 127, 180, 237, 506 
shifted, 205 

wave front set, 31 

wave operators, 245 


completeness of, 248, 251 
Schrédinger, 250 
Weingarten formula, 630 
Weingarten map, 573, 598, 629, 641 
Weitzenbock formula, 343, 355 
Weyl calculus, 72, 145, 380 
Wey] law, 195 
white noise, 464 
Whittaker’s equation, 153 


Index 


Wiener measure, 406, 407, 495 
with drift, 511 
Wronskian, 161, 286 


Zaremba’s principle, 45 
zeta function, 148 
zonal function, 132 


687 


